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651 


CCAGCGAACT 




701 


CCACACGACA 




751 
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TGGGGTCAGT 
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GGGCCTCCTA 
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2001 


GACAAAGCGC 




2051 


CAACCACACT 


2101 


TGCTATGTTC 




2151 


TAGGAGGCCC 




2201 


TCCTGCCCAC 




2251 


AAGCCACACA 




2301 


GTGTTGAGTA 




2351 


AAGGATAAGA 




2401 


ACCATCAAAC 




2451 


GGGTGGCTCA 




2501 


AGGAGAGCAG 




2551 


AACTCTAGGA 




2601 


GACTCCTTGG 




2651 


TAGGTGATGC 




2701 


ATATTGGGCA 




2751 


AGATCAGGCA 




2801 


AAGAAGGACT 




2851 


TGGAATCAGG 




2901 


TGGGAAAAGG 




2951 


ACGCTCACCC 




3001 


TGTTAAGCAG 




3051 


CTGTCTTGAG 




3101 


CCACTATCTC 



CCGCCCGCAG TTCCCGGCCC 
ATGTGCTGTT CCGGCTGCTG 
TGCCTGCTGT GCCGCTTCAC 
CACCATCGGT GTTGACTTTA 
AAGTGCGGAT ACAGATCTGG 
ATCACAAAGC AGTACTATCG 
CATTAGCAGC GAGCGCTCTT 
TGGATGAGTA CGCACCAGAA 
GCTGATGAGG AGCAGAAACG 
GGCGAAGGAG TATGGCATGG 
TCAAGATTAA AGAGTCATTC 
CATAGGAAGG AGCTGGAAGG 
ACTGGCAGAG CTGGAGGAGG 
CTTCGAAAAC CTGCTGGTGC 
CCCCTCTTCC CTCAGGAGGC 
CCCTGCTGCT GTCCTCTCGT 
TACTCCCCCT GCCTGGCCCT 
CCCTGTCCCC AGCCCGTCCA 
CCAGCCACAG GCCTGCTACG 
ACCATCCCGC ACCCACCAGA 
TCAGCTGCTC CTTTCTCCGT 
TTCCCCCACT TCTCTTTCTC 
AAAGCTCCTC AAACCCCGTC 
CTTTCCTTCC TTCCTAAGCT 
AGGTTCCACC CTTGGATCCA 
GGCCAAAGGC TACAGGGTGC 
TCATGTGCCA TGGGCCTGCC 
AGGTAGGAGG GAAACAGCAA 
ACCTCTACCC ATTCCCCGAC 
TGCCTCTTTG GGGAACTGAG 
AAAATGAGGG GTGGCAGGGA 
TGCAGCATGA ACACAATTTC 
GGACCAAGGC CTGTGGGTAC 
GACACTGTCA GGCCAACCAT 
AAGGACTAGC TCCTATGTAT 
GACAGCAGTT TGCACAGCAG 
GGCCCCATCT TCCATTTCTT 
GGCGGAGGAC TGAGCCTAGC 
AAGTTTATGG GAGAAGGGCA 
CTGGGTAGAA ACGCTTGGTG 
TCATCTTGCC CTCTACCTCC 
GTGGCTACAG GTGGAGGGAA 
AGGAAGTTTC TTTAACCCCA 
TTTAAAGACG GAACAAGTAA 
CGTCCCAAGG TGGGCGAGGC 
TTAGCTGCAC TGCGTGGCTG 
TTCATCAACT AAGAACCAAA 
ACCTCACTTC CTTACTCCTC 
CTTTCCTCCT GACTTACCAA 
GGACTTAGGG ACAGGGTATA 
GAAGTTGTAA ATCACTGGCT 
TGAAGCTGTG ACTAGGCTGG 
GTATCAAGAC CTATGCCACA 
CAGCCCTCAA GTCTGTCTTC 
GTGGCTCCAA TCTGTGGACC 
GGGTGCCATT CATTGTCTTT 
ATATTTGTAC TGTACCCTAG 
ATTCTATAGA CTGATAGGCC 
TATGGGAGCA GAAGGAGAAT 
TAAACTTTTG GTGGCCTGGG 
GCTCTGCTGG CCTGTTTACT 
ACTCCATCCA GCCCCAGGCA 
CCTGTGACGG GTGAACTTCG 



CGCTGGCCCC AGTCATGGCG 
CTGATCGGGG ACTCCGGGGT 
CGACAACGAG TTCCACTCCT 
AGATGAAGAC CATAGAGGTA 
GACACTGCAG GGCAGGAGAG 
GCGGGCCCAG GGGATATTTT 
ACCAGCACAT CATGAAGTGG 
GGCGTCCAGA AGATCCTTAT 
GCAGGTGGGA AGAGAGCAAG 
ACTTCTATGA AACAAGTGCC 
ACGCGTCTGA CAGAGCTGGT 
CCTCCGGATG CGTGCCAGCA 
AGGAGGGCAA ACCCGAGGGC 
TGAGTCCTGT GTGGGGCACC 
CCGTGGGCAG ACAGGGGAGC 
GTGATGACCC TATTGAGTAT 
GAGAGCGGCT CTGCTGTCAT 
CCCTGGAGTG GTCTTCTTCA 
ACCCCCACGA TGTGCCGCAA 
CAACAGCCAG GGCTGGAGTC 
GCATCGTGTC TCTTCTCTGC 
TGACCCCTCC CCTCCGGTGC 
CCCCGTGTGT CCTGCTGTGT 
ATCCAAGGGG ATGGACCCAG 
GGAAGAACCC TCCACCCTGC 
TTCTTCCTCT TCCCCCACCC 
TCCCCAGTGA CCTGCGAAAG 
CCGGGGAGTC CTCGAGCCTG 
CAGAGCTTTG CCCTTGCTTG 
CTCAGAGGCA GGTGCTTCAG 
TAAAAAGTCA CCTCCATTCT 
TCTCCACCTG GCTCCCAAAT 
TCCAGGGGCA AGGAGAGCCC 
GCACTCCACA AAGGGGAGCA 
CAGGTTAAGA GCAAGGGAGA 
AGGGGAATGT AGCAACAGCA 
AGGTAAGAAG AGCATTTCCT 
CTTCAGCAAC CAAGGTTCTC 
AAGACTTCAT GGGAAGAGAG 
CTGTTCTCTT TGGCCTTTAA 
TGATAGGCTT GAGGGTTTGC 
GAGGACTCCT TCCTCCAGAG 
TATGGCCCAA GAGTAGCTCG 
TTTACCAGTT CTACTGGGGT 
CTAGGAAGAG GGTCATTCTT 
CAGCCAAAAC AAAGAACTGG 
ATCCAGGGCA CTCATATGTG 
CAAAAAGAAG TGGGGAAAGA 
ACCAGGAAAA CAGCAGGAGA 
GCTTAGATGG TGGAAAGCAA 
AATGAGAAAA GGAGACAGCT 
AGTTGCTTCC TTGAAGATGG 
TCACACTGGG GCTAGGGAAG 
AGCCAGGGAC TTGAGAAGTT 
AGTATTTCAG CTTTCCCTGA 
CTCTCCTAGC CCCCTCAGGA 
GGGTTCTGGA AGGGAAAACA 
CTATCCACAA GGGCCATGAC 
TGGGATTTTA GGGTGCAGCT 
GCATGTCTTG AGGCCCAGAC 
CGTCACCACC TCTGCACCTG 
CGCCACCTGC TCCTGAGCCT 
TGTACTGTGT CTCGGGTCCA 
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3151 TATATGAATT GTGAGCAGGG TTCATCTATT TTAAACACAG ATGTTTACAA 
3201 AATAAAGATT ATTTCAAACC ACCAAAAAAA AAAAAAAAAA AAAAAAAAAA 
3251 AAAAAAA (SEQ ID NO:l) 

FEATURES : 

5'UTR: 1-44 

Start Codon: 4 5 

Stop Codon: 681 

3'UTR: 684 



Homologous proteins: 

Top 10 BLAST Hits 



*3 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



|33500 
|18000 
|18000 
|18000 
|33500 
|18000 
| 18000 
|33500 
j 18000 
|18000 



1101587 
0049373 
0051870 
0049296 
1098683 
0050961 
0049528 
1098688 
0049453 
0051630 



561 /WO20005 

98 /altid=gi 
45 /altid=gi 
18 /altid=gi 
352 /altid=g 
41 /altid=gi 
69 /altid=gi 
905 /altid=g 
80 /altid=gi 

99 /altid=gi 



84 73 /org=Homo sapiens /taxon=9... 
|464561 /def=sp|P35289|RB15_RAT RAS . 
| 7498104 /def=pir| |T33855 hypotheti. 
|131798 /def=sp|P24407|RAB8_HUMAN R. 
i | 11422744 /def =ref | XP_001482 . 1 | TR. 
|2317272 /def=dbj |BAA21744.1| (AB00. 
| 131848 /def=sp| P22128 |RAB8_DISOM R. 
i | 11432830 /def=ref |XP_007682.l| RA. 
| 131847 /def=sp|P22127|RA01_DISOM R . 
| 7705849 /def=ref |NP_057215.l| ras- . 



BLAST dbEST hits: 



gi | 12333507 /dataset=dbest /taxon=96... 
gi | 12120217 /dataset=dbest /taxon=96... 



Score 


E 


428 


e-119 


423 


e-117 


220 


6e-56 


216 


le-54 


214 


4e-54 


214 


4e-54 


212 


3e-53 


211 


4e-53 


211 


5e-53 


210 


6e-53 


Score 


E 


626 


e-177 


377 


e-102 



5 3 

Id 



EXPRESSION INFOR14ATION FOR MODULATORY USE: 

library source: 

From BLAST dbEST hits: 

gi | 12333507 brain 

gi | 12120217 epid_tumor 

From tissue screening panels: 
Fetal whole brain 
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1 MAKQYDVLFR LLLIGDSGVG KTCLLCRFTD NEFHSSHIST IGVDFKMKTI 
51 EVDGIKVRIQ IWDTAGQERY QTITKQYYRR AQGIFLVYDI SSERSYQHIM 
101 KWVSDVDEYA PEGVQKILIG NKADEEQKRQ VGREQGQQLA KEYGMDFYET 
151 SACTNLNIKE SFTRLTELVL QAHRKELEGL RMRASNELAL AELEEEEGKP 
201 EGPANSSKTC WC (SEQ ID NO : 2 ) 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 

205-208 NSSK 

[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 2 

1 92-94 SER 

2 206-208 SSK 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches : 2 

1 29-32 TDNE 

2 104-107 SDVD 

[4] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

101-109 KWVSDVDEY 

[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 2 

1 18-23 GVGKTC 

2 136-141 GQQLAK 

[6] PDOC00017 PS00017 ATP_GTP_A 
ATP/GTP- binding site motif A (P-loop) 

15-22 GDSGVGKT 

[7] PDOC00579 PS00675 SIGMA54_INTERACT_1 

Sigma- 54 interaction domain ATP-binding region A signature 
11-24 LLLIGDSGVGKTCL 
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BLAST Alignment to Top Hit: 

>CRA| 18000004937398 /altid=gi | 464561 /def =sp | P35289 | RB15_RAT 

RAS - RELATED PROTEIN RAB-15 /dataset=nraa /length=212 
Length = 212 

Score = 423 bits (1077), Expect = e-117 

Identities = 207/212 (97%) , Positives = 209/212 (97%) 

Frame = +3 

Query: 45 MAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHSSHISTIGVDFKMKTIEVDGIKVRIQ 224 

MAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHSSHISTIGVDFKMKTIEVDGIKVRIQ 
Sbjct: 1 MAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHSSHISTIGVDFKMKTIEVDGIKVRIQ 60 

Query: 225 IWDTAGQERYQTITKQYYRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIG 404 

IWDTAGQERYQTITKQYYRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIG 
Sbjct: 61 IWDTAGQERYQTITKQYYRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIG 120 

Query: 405 NKADEEQKRQVGREQGQQLAKEYGMDFYETSACTNLNIKESFTRLTELVLQAHRKELEGL 584 

NKADEEQKRQVGREQGQQLAKE YGMDFYETSACTNLNI KES FTRLTELVLQAHRKEL+GL 
Sbjct: 121 NKADEEQKRQVGREQGQQLAKEYGMDFYETSACTNLNIKESFTRLTELVLQAHRKELDGL 180 

Query: 585 RMRASNELALAELEEEEGKPEGPANSSKTCWC 680 

R ASNELALAELEE+EGK EGPANSSKTCWC 
Sbjct: 181 RTCASNELALAELEEDEGKTEGPANSSKTCWC 212 (SEQ ID NO:4) 

>CRA| 335001101587561 /dataset=GENESEQ /org=Homo sapiens /taxon=9606 
/mol_type=protein /date=08-FEB-01 /length=218 
/altid=derwent_id|B41604 /altid=derwent_ac | B41604 
/def=Human ORFX ORF1368 polypeptide sequence SEQ ID 
NO: 2736 /patent=WO200058473 -A2 /pat_sect ion=Claim 
Length =218 

Score = 428 bits (1088), Expect = e-119 

Identities = 212/218 (97%), Positives = 212/218 (97%), Gaps = 6/218 (2%) 
Frame = +3 

Query: 45 MAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHSSHISTIGVDFKMKTIEVDGIKVRIQ 224 

MAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHSSHISTIGVDFKMKTIEVDGIKVRIQ 
Sbjct: 1 MAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHSSHISTIGVDFKMKTIEVDGIKVRIQ 60 

Query: 225 IWDTAGQERYQTITKQYYRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIG 404 

IWDTAGQERYQTITKQYYRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIG 
Sbjct: 61 IWDTAGQERYQTITKQYYRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIG 120 

Query: 405 NKADE EQ KRQVGREQGQ QLAKEYGMDFYETSACTNLNIKESFTRLTELVLQAHR 566 

NKADEEQKRQVGREQGQ QLAKE YGMDFYETSACTNLNI KES FTRLTELVLQAHR 

Sbjct: 121 NKADEEQKRQVGREQGQQKCPSLQLAKE YGMDFYETSACTNLNI KES FTRLTELVLQAHR 180 

Query: 567 KELEGLRMRASNELALAELEEEEGKPEGPANSSKTCWC 680 

KELEGLRMRASNELALAELEEEEGKPEGPANSSKTCWC 
Sbjct: 181 KELEGLRMRASNELALAELEEEEGKP EGPANSSKTCWC 218 (SEQ ID NO: 5) 
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Hmmer search results (Pf am) : 

Model Description 



Score 



E-value N 



PF00071 Ras family 

CE00060 CE00060 rab_ras_like 

PF00006 ATP synthase alpha/beta family 



323 .8 
211.0 
4.2 



8.2e-95 
1.8e-59 
2.1 



Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00006 1/1 10 24 . . 203 217 .. 4.2 2.1 

CE00060 1/1 2 165 .. 16 184 .. 211.0 1.8e-59 

PF00071 1/1 10 212 .] 1 198 [] 323.8 8.2e-95 
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ACCCGGTGGC 




2951 


TGGGTCCATG 




3001 


TGAGGGGCCC 




3051 


CTGCTACCCG 




3101 


GAAACACCTG 



GCACAGCTGG TGAGTGGTGA 
TCCAGAGCCC TTGGCATGCA 
ACGCGCACCT CGAAATAATG 
TGTGTAATCA GAGTAGCTAG 
TTTTTTTTTT TTTTTGAGAC 
TGCAATGGCG CAATCTCGGC 
CAATTCTCCT GCCTCAGCCT 
ACCACGCCCA GCTAATTTTT 
ATGTTGGCCA GGCTGGTCTT 
ACTTAACCTG AAATATACCT 
TCCTTGACCT ACTATTGAAA 
AAACAGGTAT GTACATATAT 
GAGACAAGGT CTCCCTCTGT 
CGGCTCACTG CAGCCTTCCC 
GCCTCTCAAG CCTGGGATTA 
TTTTATTTTT AGTAGAGACT 
CAGCTCCTGA CCTCAAAGTG 
GGATTACAGG CATGAGCCAA 
CCTAATAGGC TAGACTTTGC 
CAATTTTCTC AACTCCGGAG 
ACATTCAGTG GTAGGAAACT 
GTGCCACTGG CTATACCACA 
GGTCACCTGG GCACCTATCC 
CCCGCACACC TCTAAAGGAT 
TATCTTTTCA CCCTGGTGGA 
GAAAAAATGG GGACGGGGGA 
CAGCCAGATG GCAGGGAGGG 
GGAGACGGGT GTTCGAATCC 
TGAGCCCAGC TGAGGTACCA 
ATGCAGGGTG CCACGGCGGG 
ATATGGGGAG AAGTGGTGAG 
TGGGTGGGGG CTCACCTCAA 
GAAAACGTTC CCCCGCCCCG 
GCCCCGCCCC AGCGTCGCCT 
CTCCCCGCCC CCTCTCTACC 
CGGGCAGGGG CCGCCGCCGG 
GCCGAGCCGA GCCGAGCCGA 
CGCGCTTCCC GGCACCCAGC 
GCCGCGGGGG CCGGGCCCGG 
GCGGGCGCGG GCCGCGATGG 
CGCCCGCAGT TCCCGGCCCC 
TGTGCTGTTC CGGCTGCTGC 
GCCTGCTGTG CCGCTTCACC 
ACCATCGGTA AGGGGCGGTG 
GCCCCTTTCC CCGCCGCCCC 
CTCCGGCTGG AGGCGGTGGC 
GGGTGAGCAT AGGTTTTGGC 
GGACCAGGGG TGGGGTCTCC 
CAAACCGAGA AACCGGGGGT 
GAGCTGGGGT AGCAAGAATC 
CAGGGGGAGG GGAAGTAAAG 
AGACCGGAGT CACTGAGGCG 
CCGGCGGGGG GCGAGGGGGG 
ATCGAGTTTC CAGGGCAGAG 
TGGGGGAGGA CACAGATAGG 
ACATCTGTTC TGAGGGAGGC 
TGCCCAGACC AAGGGGTCAG 
CTCATCTGGC AGTGCTGGAG 
ATCCCAGACA GCACTATGAT 
GCCTCCCCTC TGAGACAGTC 
CCTCCCTGAG AAGTCTGAGT 
GGTCAGCCCT GGAGTAGCCT 
CTGGCTGGCT GGTCATCCAT 



CACTGGGACT GGAACCCAAG 
CCTGAAACCC CATGTAAGCC 
GAATCCACTA CATCAGTTCC 
CAGGCTCGGG ATTTCGCCCC 
AGAGTTTTGC TCTTGTTGCC 
TCACCGCAAC CTTCGCCTCT 
CCCGAGTAGC TGGGATTACA 
TTATATTTTT AGTAGAGATG 
GAACTTTTCC CCTCTTATTA 
TTTCAAATGA AGTAAATGGG 
AATACATTCT CCATCCAATA 
ACTTTTCATT GCTTTTTTTT 
TGCGCAGGCT GGAGTGCAGT 
CTAATGGGTT CAAGCAATCC 
CAGGCGAGCC ACCGTGCCCA 
GGGTTTCACT ACATTGGCCA 
ATCTGCCCGC CTCAGCCTCC 
CGCGCCTAGC CTTTCATTGC 
TCTCCCTCAA TACTCGTTGG 
AGCATTCATT TGCCTCTCTC 
GGATCTTGAA CAAGGGCCAT 
GAGAAATTTA GGGGTCTGAA 
TAAGCACCTT AGAGGGAAAA 
TTTGCACTTG GAGATGTTCT 
GGCCGTGAAT AGGCATTTTC 
GGGCCGTGAC ACAGTCACAC 
GGAATTCCAC CCACACTCTC 
AGATCCTGCT CAAGGCCTTC 
GCCACTGGGG AGCCCGGCCA 
CGGAATTACC GGCGCCAGAC 
CCCGGAAAGC GGAGCACGGT 
CTCCCCCATT CGGAGCGTCC 
GGCAGGAAGG GGTTGGGAGG 
GCTCGATGGG GTCCCGCTCT 
GGGGCGGCGG CGGCGGCGCA 
TTTCTCCTCC CACCGCCTCG 
GCGGGCGCCG CGCCGGGCTC 
GAGCGAGTGG GCAGGCGGGC 
CGTCCTCCTC GCCGCCCGCA 
CAGCGGCGGA GCAGGGCTGA 
GCTGGCCCCA GTCATGGCGA 
TGATCGGGGA CTCCGGGGTG 
GACAACGAGT TCCACTCCTC 
GCCCGGGGCG CCCCTCCCTC 
CGTCCCCAGC TGGGGAGGAA 
GCCGGAGGCC GGAGTCCGGG 
AGGTGAGGGT GTCCCTGCTG 
CGCCTCTTGC CGGGAAGCCT 
GAGGGGAGCT GGTGTAGGCC 
GTAGCCGCTG GAATAACACC 
CTTCTGCTAC GGAAAAGGGG 
CCCTTGGTTC TGTGGTGACC 
GAAGAGAGGA CGTACGGAGG 
TTGGGAAGGT AAGCCGCAAG 
GTGAGGAGCC CCTGCGCCTG 
TAAAGAGGAT GGAGGAGTGT 
AAGGCAGGCA GGACCCGCCT 
CCTGTGGTTA GAGGGACAAG 
GGGGTCACTT ATTCTAGGAA 
AGTCTCCCGC TTCTAGGCTG 
AGAGGGAATT TCATCCTCAG 
CTGCATTGCC CAAGCCCCTG 
TTGGAATGCT CTCCTAGAAG 
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AT AAC CT AGA 




C C ft T 

5501 


TAGTGCCAAA 




5551 


pmii tv tv tv rp/~t 

GTAAAGGATG 
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5951 
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6001 


CTAGTACAGG 




6051 


ATGCAGGGAC 




6101 


AACATGAGCC 




6151 


GGTTGGCTGG 




6201 


GCTGCAGTAG 




6251 


CCAGTGGTAC 



CATCAGGGAT GGGCACCAGC 
CACCATCCCC CCTCCCCCCA 
AGCAGATAGA TACAGCCAGG 
CTTTATTGTC CAAGAGACCT 
AATATTGCCT AAACTTCTGA 
GTATAATAAT TGCACCTACC 
AACGATTGTT AAGTGCTTAG 
ATGCTGATTG TTCTTACTGC 
TCCTCTCTTT GTTTCACCAA 
CAGGGAACAG AACTTGTCCG 
TTAGGAAACC CAGGAGCTTG 
GCAGAAAGAA GGGGAGCCTA 
GACTTGGGGA AGAGGCAGGT 
AGTGGCAGAG GAACCAGATG 
CAAAGGGCGG TGGGTTATCC 
GAGAGGCCTG GTGCTTGGAA 
TGCCTGGGAG GCCCTGGCTT 
CTCCTAGTCC CATTCCAAGT 
GGGCTTCATT CAACCACCTC 
GCTGTGGGAG GCAGCCTCCC 
CACCCCTCTT TCTAAAATAC 
TTAAAGAGAG TCCATTGCCT 
TCCCTTCCCC TTGTAATCTC 
ACACTAAATA GGTCACAGGG 
TTGGGCACAT AGGTGAATCA 
TGTCTCCCCT CAAGGCCCTT 
CCCAGATTTC TTCCTCCGCT 
CACAGAGGAT TCTCTAGGGT 
GGAGTTTGGA TTCCATGGAA 
CCTTTTAAAA TCTGCCCCAC 
CTTATTAAGG GACTTGGGAT 
TTAACTAAAA CTTAAAGGAA 
TTAAGTGATA TGGACAGTTT 
CAAATCAAGG AGCCAACGAA 
TATTTTTAAA AAATGCGTGT 
GAGATTCCAA GATGAATAAT 
TACCTAAAAC ACATTACATG 
CCCAGGGGCT TTTTTTGTGA 
TGCCAAGATA GTTTGGCTTC 
ACTTTTGTGA TTTACTGTGT 
AGAAGAACCC GCCAGGGAGC 
CTGACTGTCC CTCCCCCTAT 
AGGTGGCCCT GCCACTGGCT 
GTGCTGGCCT GGCTGGGGGT 
TAAACAAATT AATTGAGCAT 
GAGACATATT GCAGTCTACG 
TGGGGAGGCA AGACAAAGGG 
ATGAGATATC TGAAAGAACT 
GATTATGATC ATTGCAGTTA 
GAAGCTAAAT TATGATGCAA 
ATGGATGGGA TGGGTGAGTG 
TGGGGAAGAG AGAGGCCCCT 
CTAGGCCTCA GTGTGGGCTG 
ATGGAGTAAC ATGAGCAGAG 
TGGGGCTCTA GGCAGTTTAG 
AGAAGATTCT TCTGTTTCTC 
TGGTGCGACC AAATCTTATT 
GCCAGGCAGA TTCAATGCCT 
CAGTTGGCAG AGGGCATTGA 
TTAGCAATGG TAGGAGGGGT 
AAGAAGGGGT TAAGACTTGA 
TATCCTGGGC ATGAAGGAAC 
CAACACCAAC TTCCACACAG 



TCTCAGCTTC CTCTTGAGGA 
ACACACATTC CTTGCTGAGA 
AGGAACAGAA CCTTCTGGTT 
GAGACCTCAC TGTGGGGCAA 
GCTTTATTTA GTTTCTCATC 
TGCCAAGTTG TTGTCAAGAT 
CACAGCCAGA CACATGGTGA 
TATTGCCATT ATCATTGAGC 
CTTGAAGGGT GAAACAACAG 
TCTTTCTCAG AGGAGCTGTA 
GGCTGAGCCA TGGTTTGGAT 
TAGACACTAA GGCTTTGTGC 
GCAGGAGAAA GGCATGGGCG 
GTGTATGAGG ACAGGTTGTG 
CTTAAGGAAA CTAGGAGTGG 
CTACTGAGCT ATCTCCAGAG 
TGGCCTGAAG AGCTGTTGTT 
CCTATAGGTG ACATGGACTT 
ATTTCAGAAG CTCTGGGACT 
CTGGGAGACA CATACCCTCC 
CAGGATGGCC CTCTGAGGCT 
CACACCTCTA ATCATCCACC 
CCTTCTTAGA CACCTTCTGC 
ACTTCCTGAA ACCCTCCAGG 
GTGAACTGAG TAGGGGTGTC 
GGTATATTGG CCTAAAACCT 
TCTGACACCC GGGTTCCCCT 
CCCCCTTTCC ACAGGACACA 
TAGAAAGAAA CCTGTCTTTC 
TGGGTATCTT AAATGCTTTC 
TCTCCCTTAT CTTGGGCGTG 
AGAGTTGGAT GGTCAAGAAA 
CTCAAGGAGG TAGAAGGGGC 
ATGAGTGCTA CCAAGTCATA 
CCTGTATGCC AGGCTCTGCA 
ACCTACAGTC ACTGTTCTCA 
ACCATGCTGG CCACTGATCG 
ATTAAGAAAA CAAGGTAATT 
TTGGCTCATG TGGATATCAC 
ACTCCACTTT AACGGCCTGC 
AGTGAGAGGC CTCCCTGGTA 
CCTTTTCGTC TTTCTGGCCA 
CTGCCACAGG CATTTCCTTT 
TGGTGCGAAG GGGTCCCCAA 
TCACTACCAA GCTCTGTGCC 
TTTTCTGCCC ACAGAAGCCC 
AAAAACAAAA AACAAAGAGC 
TGGTGAATCA CTCTTCAAAT 
CTCTTAATGA AGGTCTCACA 
GATGTACCAT GAGGCAGCCG 
CTATTCCCAC GACTCCATGC 
GTGGACTAGA ACCGGCAGGG 
CTCGTCAGTT CCTGCAGCAG 
ATAACAGAGG TGGGATTGAG 
ATGCAACAGA AGGGATTCTT 
TCTCTGTCTC CCAAATTATA 
CCTCATTGTT TTTATAGTCC 
GTTGTTAAAT TAATGAATGA 
GAGCCTGGCC AAGGAGGTGG 
TTTGAGTAGG GTACTAATGA 
AGCAGGGAGA CTAGTCAGGG 
CTCTGAATGG CCCCTCACCC 
TCAGTTGTTC TACTTTCCCT 
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6301 CCAGAATGGG GAGTGGTTCA AGCCAATCAA CCTGGCAACT TCTGAAAGAA 
6351 TCTTATGGGA CCTGTGCCAT GACCAGGTAG GGAGAAGATG TCATACATGG 
6401 ACATCTATGT TCAGGGGACC TTTGAGGACC TTTCTGCATG GTGGCCAGGA 
6451 GTGAGATGAT GTAAACCACA AATGGAAACT GAAGAGACTG CTCAGGAGTT 
6501 GTTGGTTTTC TTTTCTTTTC TATTTTTTTT TTTTTGAGAC TAGGTTTCAC 
6551 TCTGTCACCC AGTCTGGAGT GTGGTGGTGG CACAATCACG GCTCACTGCA 
6601 GCCTCGATCT CCTAAACGCA ATCCTCCCAC CTCAGCCTCT CAAGTAGCTG 
6651 GGACTACAGG TGCATGCCAC CACATTCAGC TAATGTTTGT ACGTTTTGTA 
6701 GAGATGGGGT TTCACTATGT TGACCAGGCT GGTCTCGAAC TCCTGGACTC 
6751 GTGATCCACC AGCCTCAGCC TTCCAAAATG CTGGGATTAT AGGCGTGAGC 
6801 TACCTCACTC CCTCAGGAGT TGGTTTTCTC CCTCCCATCC TTAGTCTTCC 
6851 CTGAGTAGAC CTGTCACCTA GTCCCTGGAC CTTTTGTTTT GAAAGCCACC 
6901 CTCCAGGCTA CACTCCTTCT GGGTGAGGAG GAGGGTGATC TGGTTGGACA 
6951 GGTTGGGCTG CTGTGGCTTC AGGGCACTTT CTCAGGCTGG GTTGCTGCTG 
7001 CTATGTCACC TTTCTCAAGG AGTTCTGCTG GGACTGGCTT GGCTGCCTGT 
7051 CTTGACTTTG CTTTTGACTG AGGAGGTGGG AGATGGTGAG GGAGGGGGTG 
7101 GGGCTAGATC CAAGCCTGGA ATGGGGTGAC CTAACAGACA CTGGGGCCTG 
7151 TGCTTAGACA CTAGGATCCT GGGGTTTGCA GGTTTCTAGA CTGAGAGGAG 
7201 CTGGGGGCAA ATGCAGTGTG ACGTTGTGAG AGGGTCAGGG CTGGGTCTGT 
7251 GTCAGCCTTC AGGCAGCCTG AGACCAGTCT CTACCTACTC TGTTCCCCTG 

73 01 GTACCTAGAA AGGAAGGGAA GGTGAGAAGC AATGAGCAGA ATGGAAAGAG 
7351 CCCAGATTAA CATGCACATT TCCCATGGCC TTACTGGCCC TGTGACCTTC 

74 01 AGACACTTTG ATGACATCTT TGTGCTTCGT TTCTGCATCT GTAAATTGAA 
7451 GATGGTAACA GAGTCTTTCT TAAAGGTTGT TGTGAAGATT ATAGAGCCTA 
7501 GCGCATATAA AGCACTTGGC AGAGCCCTCG ATAAAATAAT AGCTGCTATC 
7551 ATATTATCAT TATTATTATT TTATTTATTT ATTTATTTAT TTTTTTTTGA 

• % 3 7601 GACCGAGTAT CTCTCTGTCG CCCAGGCTGG AGTGCAGTGG CACAATCTCG 

fg 7651 GCTCACTGCA ACCTCCATCT CCCGGGTTTA AGTGATTCTC CTGCCTCAGC 

I* 7701 CTCCTAAGTA GCTGGGATTA CAGGCACCCA CCACCACACC CGGCTATTAT 

f * 7751 TATTATTCCT AGCTATAAGA ATGCTGTAGA GATGAATACA CTGTCAGTGA 

N 7801 GCTAGGAGGT CATCCTGTGT ATCCATCACT TGTGCACTCA GTCGTTCAGG 

7851 CGCTATTTGC TGAACACCAA CTACATGCCA GGTGCCATGC TAAGATTTGG 
7901 GGACACAGTG GTGACCAAAA CAGACAGAAA CCAAGGAGCT GGCTTACATT 
7951 CCAAGGGAGT GCATAGGAAG CTGTGTTTCA TTTCAGTTTC TGCTCTAGTA 
8001 CCCCCCTTTC CCTGGCAGTG CCAGGGTCTG AGAAGGAAGA GTGAGGTGGT 
s _ 8051 GAGGAGGTGT GAAGCAGTGG GGTGACCTGA GAGGAGAGGA TGGGGTGGCT 

£3 8101 TTGCCTCAAG GCTTGGGCCC CTGCTAGGTG TCGCTCTGCC TCAGGCCTCT 

hi 8151 GTTTCTCCTC CTGACACAGG CACAGACTCG GCCTCCCACC CCTTCCCCAA 

lil 8201 GGACATGACC TTGGGAAGGA ACATATCTGA AGCCCGCGGA GGGTTTCCGC 

8251 TGCTGTGCAT CTGTGCCACA GATCCGCAGA TGCACCCACA GCTGGGAGCA 
*J 8301 CCGGTTCCTC CCGCCTACCT GCACTCCCTG GTTTCTGTTC CTTCCTCCTC 

P 8351 CTCCTTCCTT CTCCCCGCTC CCCAGACAGG CTGGTGATGA GCTTTATAAC 

8401 ATGAAAGCTG ATATTTGGCC ATTATCCTTC TACCCTGATT GCCAGCTCTT 
8451 CTCAGAGTGC CTTCTTCTGT AATCCAATCT TTGCACCAGT TTCCCTGTGA 
8501 AACTGCCAGT TTTCTGTATA GGCCTCTGCC CTCTCCTTGG CTCTTCTCTC 
8551 TGGTCAGTGA GCTTTGTCAA GGGGAACACA GGGCTTCCTG GACACGTAAT 
8601 TCCTCCCACT GAGGAGGAAG GGGCTAATCA CCAGCCCTGT TTTATTTTAT 
8651 TTTATTTTTT TGAGATGAAG TCTAGCTCTG TCGCCCAGGC TGGAGTGCAA 
8701 ATGGCTCGAT CTCGGCTCAC TGCAACTTCT GTCTCCCGGG TTCAAGCGAT 
8751 TCTTCTGCCT CAGCCTCCTG AGTAGCTGGG GATTACAAGC ATGCACCACC 
8801 ACACCTGGCT AATTTTTTGT GTTTTTAGTA GAGATGGGGT TTCACCATGT 
8851 TGGCCAGGCT GGTCTCGAAC TTCTGACCTC AGCTGATCCA CCCACCTCGG 
8901 CCTCCCAAAG TGCTGGGATT ACAGGAGTGA GCCACCATGG CTGGCCGACC 
8951 CCATCTCTTA AAAAAACAAA AAGAAAAGAA AAGAAAACAA AACAAAAACA 
9001 CTTTTTAAAT TAACTGATTA TGGTGGCATG TGCCTGTAGT CCTAACTACT 
9051 CAGGAGGCTG AAGTGGAAGG ATTGCTTGAG CCCAAGTAGT TGGAGGCCAC 
9101 AGTGAGCTGT GATCACACCA CTGTACTCCA GCCTGGGTGA CAGAGTGAGA 
9151 CCCTGTCTCA GGAAAAAAAA AAAATTACTG AGAACTCTGT GACCATGGCA 
9201 CCATGAACTA TAGAAAGGGC TAACAGTTGG CTTTGAAATG TGGGTTATGG 
9251 CTGGGTGCGG TGGCTCACGC CTGTAATCCC AGCACTTTGG GAGGCCAAGG 
9301 TGGGCAGATC ACAAGGTCAG GAGTTTGAGA CCAGCCCGGC CAACATAGTG 
9351 AAACCTCATC TCTACTAAAA ATACAAAAAA TTAGCCGGGT GTTGTGGCAG 
9401 GTGCCTGTAA TCCTAGCTAC TCGGGAGGCT GAGGCAGGAG AATTGCTTGA 
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12351 


CACCACACCT 




12401 


TGTTGCCCTG 




12451 


TTGGCTTCCC 




12501 


TTGAGACTGT 




12551 


TGAGCCCTAG 



TGGAGGTTGC CACAAGCTGA 
CAGAGCAAGA CTCCATCTCA 
TTGTTTTCTT TTCTTTCTTT 
GAGTCTTGCT CTGTCACCAG 
TGCCACCTCT GCCTCCCAGG 
AGTAGCTGGG ACTACAGGCA 
TTTTAGTACA GAAGGGGTTT 
CCTGACCTCG TGATCCGCCC 
GGCATGAGCC ACCACGCCCG 
TTCCCTGATT AAAATTTTCT 
GACTTCATTC ATCCATTTAT 
CATTTACTCT ACTTTGTTGA 
CAGTGAGAGT CTGAAGTGAA 
CAGTGTAGCT GGGAACCAGA 
AAATGTCAGG ATGCTTTGGA 
GAGGCTGGTG AAGGCTTTGA 
AGACTAGGCA GGATTCTCCA 
CTCTTCCCTG TAGGAGGTGC 
TCTTCAGCCT TGCTGGGGCC 
GCTGAAGTTC AAGAAGCCCC 
TGAGGGCAGG TGCATCTGGA 
GAGTTTCTAT GTCCTAGTCT 
GCACAGCCCT GCCTGAGAAG 
GGTGGGTCTT GGCTGCCTCT 
AGGGAGGCGA GACCCAGTTC 
TGGCTGGAGC ACACTGCACA 
CAAGGCAGTG GGAAAAGCAA 
AACCCCGTCT CTACTAAAAA 
GCCTGTAGTC CCAGCTAACT 
CACAGGAGGT GGAGGTTGCA 
CCTGGGTGAC AGAGTGAGAC 
AAAAAGTGGG GAAGGGGAAC 
ACTATGTACC TACTACCTAC 
TATTCAGTGT ATAGAGATCT 
GTTGGCCTTG CTGGCATTTG 
CGCCTCCAGA CAAGTGTAAG 
CTAAACCAGA TGCTAGAAGA 
CATGTGAGGG TCCACACAAG 
TTCCCCTTTT GTAACAAAGG 
CTGTGAGTTT GACTGAAATG 
CTACCTGATT CTTTGGATCT 
TCAGCTCAAA TGTTACCTCC 
AAGGTTGCTT GTGCTTGGGG 
GCTTCTTATC ACATATGGCT 
TGTCTTCCCC ACTAGAATTA 
TTCATAGCTT ATCTTCTAGT 
ATTCAATAAC AGTTTGAATG 
TGCATTTTCC TAGAGGACTT 
TATTGGTTTC TTGGGACTGT 
GAGACAGAGT CTCACTGTAT 
TGGCTCACTG CAACTTCCGC 
GCCTCCCGAG TAGCTGGGGA 
TTTTTGTATT TTTAGTAGAG 
CTTGAACTCC TGACCTCAGG 
TGGGATTACA GGCATGAGCC 
TAAGACAGAG TCTTGCACTG 
TCAGCTCATT GCAGCCTCAA 
TACCTCAGTC TTCTGAGTAA 
GGCTAATTTT TAAATTTTTT 
ACTGATCTTG AACTCCTTGG 
AAAGTGCTGG GTTTACAGGT 
TAAGATGATG AGGCTGGAGG 
AGATTCCTTA CTCCAGAGTG 



GATCGCACCA CTGCACTCCA 
AAAACAAAAA TAAAAACAAA 

GCTGGATTGC AGTGGAGGAT 
TTCAAGTGAT TTCCCTGCCT 
CGCACCACCA CGCTGGGCTA 
CACCATGTTG GCCAGGATGG 
ACCTCGGCCT CCCAAAGTGC 
GCCTAAAAGT GGGTTATTTT 
CTTTGCCCAT CTTTTCTCTA 
TCGTCTCACT TGCTCATTCA 
ATAATATTTA GTGATCTACC 
CAGGATGCTG CTTTGCCCTC 
CATCCAAACA AGCAGAATAT 
ATCACAGAGG AGTGAGAAAT 
AGAGGAAGTG ACATTTGAGT 
GGGGCCCTGG GTGTGGGGGA 
TGTGGAGAAC ACCTCCAGTG 
AGCTGGAGTG GCCACACCAT 
TTGCCAGGAG ATTGCTTTGC 
AGCCCCCTTC TTTCTAAGAT 
TTTCTTCCCT GAACCTTTTG 
GAGGCTGGAG GAGTGAGTGG 
GTGGTGCCCG CTGGCCTAAG 
CAGGGGCTGC CAATGGGGAG 
TGTCACCAAG GCTCTAGGGA 
GGGGAAGACC CAGCCTGGTC 
TACAAAAATT AGCTGGGTGT 
TGGGAGCCTG AGGCAGGAGA 
GTGAGCCGAG ATCGTACCAC 
CCTGTCTCAA AAAAAAAAAA 
ACTGATCCTG ATTATCTACT 
ACAGGGACGG TGGGCTTTAC 
CAGCATCACA TAGGAGCAGG 
AGAAGTTTCT TGGTGTATTC 
TGCTATTGAA TGCTGACTAT 
CACGCAGTAA ACAGTACAGA 
ACCTGAGAGA AGGGAGGGGT 
AGAGAGTACT GTTGACCCTC 
TGTCCTGCCA CAGGATCTTT 
CCCTGCTGGC ACCTTCTTGT 
TTTAAAATGT CTTCTCTGGC 
TCTCCTCACT CTCTACTTTA 
CTCTGAAATT AGGTATTCAT 
AGCTCTGATG ACAAGGATCT 
ACCTGGCTTA GTTCCTGGCA 
AATGAATTAA CAAATGAAGG 
CTGTTCTTCC CTGAGGGAAG 
TTTTTGTTTG TTTGTTTTGT 
CCCCCAGGCT GGAGTGCAGT 
CTCCCAGGTT CAAGCGATTC 
TTCCAGGAGC CTGCCACCAC 
ACAAGGTTTC ACCATGTTGG 
TGACCTGCCT GCCTCTGCCT 
ACCACGCCCG GCCTGTTTTT 
TCTCCCAGAC TGGAGTGCAG 
CCTCCTGGCC TCAGGTCCAG 
CTGGGCCCAC TGGTATATAC 
GCAGAGACAT GGTCTCACTA 
GTTCAAGTGA TCCTCACACC 
GTGAGCCACC ATGCCTGGGC 
GAGTGGATGG CCTCACTGCT 
CCCTGGCTGC AGAGGTGGCC 
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ACTCCAGCAA CCTGGCTGAG 
CCCTGAATAA TAGGCAACAT 
CTCTGCATAT ACCAAAGAAG 
ATATCAGCTC CCGGGATCCC 
GGGCTGCACA GGGTGACCTC 
GGAGGTGCAC ATGGCCCACA 
GGGCACTCCT GGGACCTGTG 
GATTGGGTGT CTGCCTTGGG 
GGAGTCTCAT TACTAATGAA 
CTGTGGTTGA TCCTGGTGAT 
CCACTGAAGC ATGCGGAGCT 
CCTTAACCCA TGAGGTGCTG 
AGAATTGAGG TTGTAGCCAG 
GCGGCATTTC CTCTTTTTGT 
CTTCATCTCT CTGTCCCACT 
CTACATCCTA GAGCTCTTTC 
AATCTCATGC TTCCCTAAGG 
ATAAAGTTCT GCGTGCCAGA 
ACAATCTTAA AAAAAAAATT 
CAGGTGAGGA AACTGAGGCA 
TGCAGGTAAT TAGTGATAAA 
ATTGTGCTCT TAATGGCCAG 
CATGTTTTCT AGGGCTTTCT 
TCTTTAGTGT GGATGTGCCC 
CCTCCACTGC TGTACTGGGG 
CAAGGCGAGC TAGCTTAGGA 
GCATCCCTCT ATCCTGCCCC 
TGTCTTCCCA GTTCTCCAAC 
AGTTGTGGCC TGGCAGCTGC 
GCAGGAAGAG GATGGGGCCC 
GCACCATCAG TTGATTGAGG 
GTGGCAGGAC ACGTCTCCCT 
TATGCTCTCT GGGGTGAGGG 
TTTTTTTTTT TTTTTTTGAG 
GTGCAGCCTC AACCTCCCAG 
GAGTAGCTGG GACCACAGGT 
TTTGTTTTTT GTTTTCTAGA 
TCTCGAACTC CTGGGCGCAA 
TTTGGATTAC AGGTGTGAGC 
ATTTGTTTTG GAATTGCTTC 
GGCAGAGATT TTAAAAACGA 
AAATGTATTC TCTGCCCCTT 
ACAAAAATAG GGGGCAGTAA 
CCTTCCAGCC CCCTTGTTGA 
TTAGGCAGCA GTGGCCAATC 
GGGGCATGTG CTCTATGGAT 
GACCCATGGC TGTGCTAGGA 
CATGTTCATC ATCCTTAGGC 
CTCATGCCCT GCTCATTACC 
GTGAGCTGGT TTGATTGGCT 
GTCACCCTGT GGGTCACCAG 
CTTCTCCACA AAGAACATGT 
TTTGGGGCTG AGCTGTTCTA 
CCATGATGGG CATATCTAAT 
TGCTTTGGGC ACCTGGTTGG 
CTGGGGCTCT CAGCGTCTCC 
GTTTGCTTGT TTCTTAACAC 
AAAGGAAGTA GACTGTTTTG 
CTTACTGGAA TCATGAACCA 
GTGGGTGTCT TAGTTAGGGC 
GTAATTTAAA GCAAGAGAAA 
GTCCAAGATC AGGGTGCCAG 
CTGTGACAAA GGTGGCACCT 



CTGATGGGCA TCATCTGATA 
GGACCTTAGT CTAAAGCACT 
ATGAGATTTG GGTGAGGACA 
TGTGTGAATG GGGTCTTTTT 
TTTAGAGGTG ACCTCCTGCC 
CATGCTGGTT TCCTGCAGTG 
CTTGGTAACT GGAGCTGGCC 
TTTCAGGTGT ATTAGGTTGT 
AAGTTCAGGT CGCACTGCTG 
AACATTTGGC ACCCAGAGGC 
TGGCTGGCAG GCAGGCAAGC 
GCCCGCTAGT AGGCACACCC 
ACTCCAGGAG CCATCTGGGC 
TGAAACATTC CAGCCAAGTG 
CTCCTTCCTC TCCCCAACAT 
CATTCCCCCT CCTGCAGCTC 
GAGCTTGACG GCTGCTTCTG 
CCCTGTGTTA TGGGTTTTAC 
CTCTGAGAAT CCTCTGTCAC 
AAGATAGGCT AACTGGCTTC 
GGCAGGGTTG GAACCAAACT 
GACACTCTGT GTCTTGAGCC 
AGGGAGGCAG ACAGTGATGG 
TGCCTGCTCC TTTCTGTAAG 
AGGCACCAAG TTTTTCCCTG 
GTCACGTGAG TGCTGGGTGT 
TGCCCCCGGT GCCCAGAGGA 
AGCAGCGCTG TCCCAGCACC 
TGGGGCAGAC ACCATACAGA 
AGGGCTGCTG CCTCAGGCCA 
AGCTTTTCTT GCCAATGTCT 
GCTCTTAAGC CTCAGGCATG 
GGAGATCCCC CTCATGGAAT 
ACAGGGTCCT GCTCTGTCAC 
ACTCAAGTGA TCCTCCTGCC 
GGACACCATC ACACCTGGGT 
GATGGGGTCT CACTTTCTTG 
GCAGTCCTCC CACCTCGTCT 
CACTGTGCTT GGCCTTTTTA 
TTTATGCCTG GCACTATGCT 
GCAAACAAAA CAAATGCTTT 
AGGTTCTTTT TGTGTAGTTG 
GGAGCAGGGA GCGATGGTTT 
AGCATCTGGC TCACTAGCTT 
CTGAGGCACT CTCAGGTGTC 
ACCCATGGGC TGAACTTGGA 
TCCACCGTCC CCAGCCCCAA 
CTCTCCGCTT CTTTCTGCAT 
AACTGGTCAG TCCCCACTGC 
TAGCTAAGCT CCCTTGCCTC 
CAAACCTGTT GATGGTCCAG 
TGCACCCAGC CCTGCTTCTC 
GCCAGAAAGG GACACAGGGT 
GTGCCGGGAA AAACAATGAG 
GAGAGGGCCC ATCTGTCTGA 
GAGAACCTCT GCCAGAGCTG 
TTCTGTGCCC TATTTCTTTC 
TAGGGACACT GTCGGGGTGA 
TGCCAAGGAG GAAGGAGAAA 
TGGCTGCTGT AACAAAATGC 
TGTATTGCTC AGAGTTTGGG 
CAGATTCAGT GTCTGGTGAA 
TCTAGCTCCA TCCTCACATG 
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GCAGAAGAGG 
CCATGCATGA 
ACCTCCCACC 
GAGGAACACA 
CCTGCCTCAC 
TAAAAATACA 
GCTACTTGGG 
GTTGCAGTGA 
GAAGACTCCG 
GCGGGAAAAG 
ATGCCAGGGT 
AAGGTGGTAA 
TGAAGCATAC 
AGCTGTGGGA 
CCTGTGGCTC 
AAGAGCTGAG 
ATGCACCTGG 
GCTGGGCGAG 
GATGCTCTAG 
ACTCTGTAAT 
CCCCAGGAAG 
GCTGTCAGTG 
GGAGGGAAGC 
GAGGCTCATG 
CGCTGGAGCC 
ATCTCTACAA 
AGTCCCAGCT 
GGAGGATCGA 
TGGCAACAGA 
GAGAAGTTGA 
TTGGACCAGG 
TATATAGCTC 
AAAATACAGA 
GGCACGTCAA 
TTCTAAAGCC 
GTGCTCTCGC 
GTTCTCAAAC 
TGATTGCGGG 
AGGCCCAAGA 

GCAGTGGCAT 
GATTCTCCCC 
CCCGCCACCA 
TCACCATGTT 
TCACCTCAGC 
CGGCCAGAAT 
GTCCAGGGAC 
AGTAGCGTCA 
CTACACCTAC 
CTCATGCACC 
GAAACCACTG 
TACTAAAGAT 
GGGCTGATGA 
GGGAAGGCAG 
AGGAAAGGCA 
AGAGCACAGG 
GGGACAGTGA 
TGCGAGGGCT 
AGGCCAGTGT 
ACAGCCACGC 
GCCAGGGCTT 
GAACAGAGCG 
ACCTTTTTTA 



• 

GAACAAGCTC 
GGGCTCTAAC 
AGCACTGCAC 
GACCTTCAGA 
TTCTAGATGC 
AAAAATTAGC 
AGGCTGAGGC 
CCTGAGATCG 
TCAAGAAAAA 
GCATCGGGTA 
GTGTGGGAAA 
GATTCCTCAG 
CCTGGGTGGG 
TGTGGGAACA 
ACAATAATAC 
GGATTCCATA 
GTGTGGTTCT 
TCCTGAGAAT 
TCCCTTGGCT 
GCCTTCCACT 
CCTTTATTCA 
CTTCACATAC 
AGTATTCTCC 
CCTATAATCC 
CAGGAGTTCA 
AAAAAAAAAA 
ACCTACCCAG 
GGCTGCAGTG 
GTGAGACTCC 
GACTTGCCTG 
TCTTTCTGAC 
TTGAAACTAC 
TTTCTGGGCC 
GGTCATTGTT 
GAAGCTCTGC 
TCTCTTGACA 
TTACGCATGT 
ACCCCATACC 
ATTTGCATAT 
TTTTTGAGAT 
GATCTCAGCT 
ACACCCCAGA 
TGACTAGCTA 
GGCCAGGCTG 
CTCCCAAGGT 
TTGCATTTCT 
ACACTTTGAG 
TCTTCACCTG 
CAGATCAGAA 
TGGAAGTTTG 
GAGGCAGAGA 
GCAACAAAAA 
CCGAAAAGAG 
AGAGAACTTA 
TTCATCTGGG 
GGTGGGGAGG 
TAGGTGAACA 
GGCCAAGGTG 
GTTGACTTTG 
AGGGAGAGGG 
GACTGGTAGG 
CAGAAGCAAT 
AATTTAAGAG 



CCTCAGACCT 
ATCACGACTG 
TGGGGATTAA 
CCACAGCAGC 
CGCATAATGT 
TGGGTGTGGT 
AGGAGAATCG 
TGCCACTGCA 
AGAGAAAAGG 
TGCCAGGGCG 
AGGCATCGGG 
CCTCCCAGGG 
TTTTAAGACA 
TGAGTCAGTG 
CTACTCCTGC 
GCTCAGGGGT 
GAAACTTTCT 
GTGCATTTCT 
GGCAAGGGTA 
TCAGGGTTCT 
GCAGAGGGAA 
ATCGTTTAAT 
TTCTACACTT 
CAGCACTGTG 
AGACTAGTCT 
AATTAGCTGG 
AGGCTGAGCT 
AGCTATGATT 
ATCCCAAAAA 
AAGTTACACA 
TCCAGGCCAA 
TTCTATCTAA 
TCACCCTCAA 
TTTGTCTTTG 
AAAGCACACC 
CTTCAGGTGC 
GTGAGAATCA 
TAGAGGGCCT 
TTGCATTTCG 
GAAGTCTCAC 
CACTGCAGCC 
CCCGCTCCTG 
ACGTTTGTAT 
GTCTCAAACT 
CTTGGGATTA 
AACAAGTCCC 
AACAGCTTGT 
GGAGCTTCCT 
TCTGCATTTT 
AGAAATATGA 
TAGATGGAGA 
GGGCTGACTC 
GCCAGATGTG 
GGAAAATCTG 
C CAT AAAAC A 
GGGTGCCCTG 
CACATGTGAA 
AGAAAGCATC 
TCCTGCAGTG 
AAGGAGCCAC 
TGGACACAGC 
AGATTCCTCT 
AGGTTGTGTG 



CTTTTCTAAG GGCGTTAGTC 
AGTCACCTCC CAAAGCCCTC 
GTTTCAATAT GGGAATTTTG 
GGGCTTCTCC TCATGTGCCC 
CAGTGAAACC CCGTCTCTAC 
GGCACGTGCC TGTAATCCCA 
CTTGAACCCA GGAGGCAGAG 
CTCCAGCCTG GGCGACAGAG 
CATCAGGTAT GCCAGGGTGT 
TGTGGGAAAA GGCATCGGGT 
TATGCCAGGG CATGTGGGAA 
TTGGGAAGCC TCTGGCCGAG 
CCAGCTTTCC AGTCCAGCTC 
GGAACATGAG AATTGGCTTC 
CTACTTCATG GGACCCGCAT 
ATGCTGTAAA GACAAGCACT 
TGTGCAGAAG AGTGAGTAGG 
CACACACCTC TGATGCTGCT 
CCTGGTTAGT AGGGGCCAGG 
CTGGGCTGGT TTTCCTGACT 
GGTAGGAGTG AGAGGACTAC 
TTATCCCAGC ACAGCCCTTA 
AAGAAAATCG GCCTGGTGCG 
GGAAGCTGAG GCGGGAGGAT 
AGGCAATACA GGGAGACCTC 
GCATGGTGGT GCACACTTGC 
GGGAGGATTG CTTGAGTCCT 
GCTCCACTAC ACTCCATCCC 
AAAAAAAAAA TTGAAGCTAG 
GTAAGTGCCA GAACCAGGAC 
TGGATGTTTC TTCCATGACA 
TATCACCCAC AGTGCTGTTA 
ATTATGATTC AGTAGGTCTA 
TTTTAAGTCA CCCCAGGTGA 
TTGAGAAACA GAGAACTCTT 
AAAACTTTTG TCCTAATGTC 
CTGTGAGAGC TTATTGAAAC 
GATTCTATAG GTCTGAGGTA 
TTTTCTTTTC CTTTCTTTTC 
CCTGTCGCCC AGACTGGAGT 
TCTGCCTCCT GGGTTAAAGC 
AGTAGCTGGG ATTACAGGTG 
TTTTAGTAGA GACGGGGGTT 
CCTGACCTCA GGTGATCCAC 
CTGGTGTGAG CCACCGCGTG 
AGGTGATGCT GATGCTGTGG 
TACTCAGGCG ATATGTGGAC 
GCAGCATCTC AGGCCTTGCC 
AACTCAATCC CCGCGTGATT 
CCTTAGAGGA GCCGGAATGT 
ATATCTCTTC TTCTCACGGA 
TCTGGGTGTG CACCCAGGTG 
GACAGAGGAC TCTTCCCTGA 
AAGAAAGGAG GTGGCTTCAG 
GTGGAGAAGG TATCCTGCTG 
GAGCTGAAGT CTTCAGTGGG 
TAAACAGTTT GCTAAGCAGC 
CGTCTGCAGA GGCCTCAATA 
CTCAGCAGTG GAAAAAACCA 
GATGGGCACG GGTTACTGGG 
TGAAGGCCCA GGTTGTGTGG 
TGAAGATCCT TGGGCTGTTA 
GGCGGGGAGG GAGGAAGGAA 
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18901 AATCCTTCAG AAGACATAGA CTTACTCTGT TTCTTCCATC ATATGTGAAT 
18951 GCATATGAAT AGCCAAAAGG TGAATAAAAC ACATGTTCCC AGGTGGCCAG 
19001 TGAGACCTAG GTTGCAAGAT GGTGGGGTGT GTGTGAGGCC GGGGAGTGCT 
19051 GCGAGCCCCG GAATTCCTCA GCCTTAGTCC CCCGCCACAT AGCTAAGAAG 
19101 TGAGGGAGGA GGTGAGAAGG AGTCACTGCC CAGCCTCACT TCCGGTGGAG 
19151 TACCCTGTCT CCTTGTCAGT TCTGTCTCTG GGGACAGTTG CCTGCTTTCA 
19201 CCTCTCCCTC CATCCCCTCT TCTCTCACAG GGAAAAATTC ACCTTAATAT 
19251 TGGAAGTTCC TCTCCTAGCA AAGTCCTTCT CAGGCACCCA CAGGCAAAAA 
19301 GGAAACTAAG CAGAGTTAGG GCTTCCAGGC CTAGCCAACT ACACGACTCT 
19351 CCTCTTGCTT CCCTAAGAAC CAGCGCAAGG GGCAGCGTGG GTTCCAGCAT 
19401 AGATGGACCT GTGTTGGAAT CTCTGCACGT GCTGTGCTGA CCCTGGCTAG 
19451 CCATTGACCT CTCTGAGCCC TTGTTTCCTT TCCACTAGGC TCTCTGAGGG 
19501 CAGGGGCCAT GTCTTTTTCA CTGCTCTGTC TGCACTGAGC ACTGTGCAGG 
19551 GCACATAGGA AGTTCCCATA AATGTTTGTG GGATAAAGGA AATAAAACCT 
19601 TCTCTCTTCC TGTCCCCCTT GTGATGGCTT TGCACAAGGC ACTGTCCTTG 
19651 GCCAGGTTTG CTAGGCTAGT GTGAGGATAA ACCAGGTATA TTACAAATTG 
19701 GAGAAAATTT CTCGTTCTTC TTGGAAGAAG GTGCTGTATC ATGAAACAAG 
19751 AATGTCTTGA TTCCCTTCTA TGCCAGGTAC TGGGGAGAAA CAGGTGCCTG 
19801 ATAACCGTTG ATCCAGGCAG AAATAAGCAT ACTCCTGCTT CCCAAGGCCT 
19851 GATGCTTCTC TCCTTCCTCC CTTCCTCCCT CCTTCTCTTC ACTCTTTCTC 
19901 TGCACACATG GAAGAATGGC TGCCAGGCAT TGCCCATTTG GAAAAGTACA 
19951 GCTCAATGGA TATGAATCAG CTTGGGCAGG CGAGAAATGA TTCACGTCTG 
20001 ACCAAATCGA TTTAGTTCAG GTTGCCCGTT CTGCATCTTT TTTCCCTTGT 
20051 AATTAAATGA TGATTGGTCT TGATGGTGGG AAGGAAGAGA CAGAATTTAA 
20101 TTTGTTTGCC TTTGTAGAAA GCTGGGGACA GCACAGATAA GGGAAGATGT 
20151 CTCCCATTTG GCAAATAACT GATGCGGAGG TGGAGTGGCA GTGGTGATGG 
20201 GGATGCTGGT GCCTTCAGGC CTTCTGGGCC GGGCAGTGCA GCTGGTGGCA 
20251 GACGGTTCGG AACTCTACCA TGTTCCCATC TGAAAACTGT GGCTGATCAT 
20301 GCCCACTCCT GACCTTGCTC CAGGGAGTAC ACAAAGACGT AAGCTTAATT 
20351 AACCCACCAG ACGTAGCTCT TGAATCCCTG GGCATAGTGC CTGGGTATAG 
204 01 TTAGAGTTGG GGAGAGGCAT GGTCAGCAAA ACAACCTCCC TCATCTCTCT 
20451 GTTGTCACTC AGAGTCAAGC TGGCTGCTGC TGGTGGTGCT GACTTCTCTT 
20501 GCTGCAGATT TCTCCAATAT GTTTCTGCCC TGCACGCATT TGCCAAATCC 
20551 CTTCGGTTTC TTGTGTCTCG TGGCAGCTTA GCTCCTCCAG CCCTTGGATG 
20601 AAGAAGCGTG GGAACTCTTT GCTTCCTTTC CCTCCCGCAG TGACATGCCA 
20651 TGCCATGCCA CTGCCTCTTC ATCTGGTCCT ATGACAGTCA CTCATAAGCA 
20701 CCCGCATGTA CCCGGCCCTG CACTAGCTCA TGACAGCTGC AGTCAATTGG 
20751 GCCAGGTGCT GTATCTCATC CGGCCTCCTC AGCAACCCTC TGAGATACTG 
20801 GTAATGTCCC TGATGAAGAT ATTTACTGAG GCAGAAATGG ACGCTCAGTG 
20851 AAGCAAGGTG CCTGATGTTA TAGCAATGAG CTATGAGTGG CCAGAGGGAG 
20901 GAGATAAGCT CAGGCCTGAC ACCAAAGCCC ATGCTCCTTC TAGTCAACCA 
20951 CAGTGCCTCC TATGGTGAAT GAGTGAGTCA GCAACCAAGA CGCATGAGGC 
21001 CTTCTTTTTG GTGAGCCTTG GCTGGGTGCT GAGGCTTCAG GTACAATCAT 
21051 GGGTTGGAAG AGCCCTCCTC TCTCTCCACA GTCTGGCACT ATGACCCCTT 
21101 CTGGTTATTA ACAAGGCAAA GAGAGAGAGG GAAGAAAGCA GGCAAATAAT 
21151 GTGGGTTGCT ATTCCTAGAG ATTAGAATTT CAGGAAGGAT AAACACAGCG 
21201 TTCTCTCCAG AAGTATAAAT AGGAAGACTT CACACATGAC TAGAACGAGA 
21251 CATGTTTTAA GTCTGTCGAG TAAGGCAGTG ATGAAGTAGA TTTCCCCAGA 
21301 TTCACTCTCC CTCCTCTGGG TCCCCCAGGG CCTTTACTTG TGGCAACTTT 
21351 CAGCTCAGGG AGGGAGGAAA GCCCCTTTCA AAGCTTCAGA TACTTCCTTA 
21401 AGGTCAGTTT CTGCTTAAAG AAGGCCTTTA CATTACTTCA TCCCTTTGCC 
214 51 AAATTAAACT GAAAGGAAAC CTTTCAAGTG TGATTGCCTG GCCCTTTCCT 
21501 GTTCATTTCT CGTGGGTACG CTTTCTAACT TTCTTTCTTT CTTCCTTTCT 
21551 TCAGGTGTTG ACTTTAAGAT GAAGACCATA GAGGTAGACG GCATCAAAGT 
21601 GCGGATACAG ATCTGGTGAG CTGGGGAGGA GGAGGAGGCA GATGTAGGAG 
21651 AAGAGGACTT CTGGCTGCTC CTTAGCTGCC CCTGCCATGT GTAAAATTCC 
21701 TAGGCTTCAC CTGGGATAAC TGGCCACCTC TCTGATGGAT GGAAGCGAAG 
21751 TCTCAGAAGC CCATCTCTTC CTATAAGCCT TAATCTCCAA CCTCTAAGAA 
21801 ACTTTAGGGG ATTGACTACA AGCACCAAAG GGCAGGAATT AGAAGGAACT 
21851 GGCACACTAA CCATTGTGAA TTTATCTCAG GATTAGGCTT TGCCCTTGGG 
21901 CTGTGCCACA CTATGTTAAG ATTGGAAGGA AGGAGGCTAC ACCCCCCATC 
21951 ATTTAGGGCG AGACCCTGAG AGAGTTCCTC AGGATAGCAT GATGAAGTTT 
22001 CCACAGTAGC AGAGGGTGCT GCTGTGGCTC TCTGCCTGAG GTCTTGGAAG 
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22051 CACTGCCTTT GCCAGGGTTT AGAGCTCCCT CTCAATTCCA CAGCAGTATG 
22101 GGCACTGCCT TCAGAGGTCC CATAGGGACT AGGGGTGTAG CAGCATCCCC 
22151 TGCCAACTCC CATCCAACCA AATCTGGCCA CAGTGGCCAG ATTCCAGAGA 
22201 GCTGTCCAAG GCCTGTTCTG GCTGTGGCTT CTGGTTTCTG CCAGGAGGGC 
22251 AGTTGGCAGG AGGGGCCAAG GCCCTGCAGG CCTGGTCAGC ACCAGCACAG 
22301 ATGACCAGGC CTCTGACTGC AGATCCCTGT GGGGATCCAA GCATCCCTGG 
22351 TTTTTCACCC TTTAGCTCCC CAGTTTTTCC TACAAGGGGA CAGCTCTGCT 
22401 CTTCCCCTCC CCGTCTGTTC CCATGGTCCC TGCTCCTCTG AGGGACTGGC 
224 51 TTTCTCCTGC AGGGACACTG CAGGGCAGGA GAGATACCAG ACCATCACAA 
22501 AGCAGTACTA TCGGCGGGCC CAGGTAAGCC ACCACATTGG GGGTTTCAAA 
22551 GTGGGAAGCT GCCACCCACA CTCCCAGCTC TGGGTATTTG AGATGTCTGT 
22601 GCCACGGATC CCCTAAATAC AGTTCGCCTG CTTGGAGGAG CGCAGGGCGT 
22651 CTTTCAGCTG TTCACTGATC ATTTGTCCGT CCATTGTTCA TGGCCCACTC 
22701 ACTGCAGGCA GGCCCCTGCC CTCACCCCTG ACTTCCACCC TCCATCCTGG 
22751 GTCAAAGATC CAGGTCAAAG CATGTGGTGT CTTCCTGCTG TAGAGAGTTC 
22801 TGTGATGGGC CTGGGAGGCG GCAGTGGTGG GGTCTGAGAG AAGAGATATT 
22851 TCTGGATGCT GAGCAGGGAG AATGGGAGAG TGGGACCCAA CCTTTAAGTT 
22901 TCCACGGCCC CTTCTGGCCC CATGACTGCA CTCTCTCTGT GCATATCACA 
22951 TCTCTCTATT TCTCTCTCTC TCAGGGGATA TTTTTGGTCT ATGACATTAG 
23001 CAGCGAGCGC TCTTACCAGC ACATCATGAA GTGGGTCAGT GACGTGGATG 
23051 AGGTAGGAGA TGCCACCTCA CTGCCGGGGT GTGGAGAGGG TGCCTCACCG 
23101 GGGAAGGCAA GGCGAGGGCC AGATGGGAAG GCAAATGCTT CCAGGAAGCT 
23151 TTGCCTTCCA CAGCCCTGGA TGAAGACCTC TGGGTGAGTA AGACATGGGG 
23201 AAGAAACCGA AGCTGCCATG CCCTCACTCT CTATACCCTG CCAGGCCTCC 
23251 ACGGCTGTGT CTTTCCCGGA AATGAATTAG TTCCAAGTCT TCCCTGTGAG 
233 01 CAGCTTCTTT CCTGAAATCT TGGGACCAGG TGGAGTTGCA AGATTGGGAT 
23351 CTAGTCCTGG CTCTGCACAA TAGCTGTGGA GCCTTGGGAA GCCATTTGAA 
23401 TCCTCTGGGT CCCCAGTTCC TGTAGAATGA GGGCTGGACT TACATCCAAT 
23451 GTCCTTTCCA GCTCTGATAC CAGTGGTCTA ACCCAAGGAA GCACCAGTCT 
23501 TAGCCAGAGT GTCTTCTACC CTAAGCTCTC CCCGTGATAC CCTTGAGGTC 
23551 AGCCATGGCA CTTGGGGGAG CCTGGCACCT GCATCCAGTC GGCCCACCCT 
23601 GTCCCTAGGG CTCTGGAATT GGTGGTGGGC TGGAGGCAGT GCAGACTCTG 
23651 TAGGGAAAAT TGGGGGGGCA GGCAGCACTC ACTGGCTGTT CTGCCCATCC 
23701 TTTGTCCCTA GTACGCACCA GAAGGCGTCC AGAAGATCCT TATTGGGAAT 
23751 AAGGCTGATG AGGAGCAGAA ACGGCAGGTG GGAAGAGAGC AAGGGCAGCA 
23801 GGTAAGTGGA GGGAAAAGGC AAGTCCACCC CAGGTCCTCT GCTGGGCCTC 
23851 CAGGGCCAGT CCTGAGCGTG GGGACCTAGG GGTGTGTTCC CCAGTGGCAG 
23901 GTCCTCCCAC ACGTCCCCAG CACCCCAAGG CCCTGGGGGA GTGGCCATCC 
23951 TCGGAAGGCT TGTTGTCTGG GTTTCAGGAC AGAAGCCCAG AGATTCGGGG 
24 001 TCCATCCAGA AACAAAGACG TCATAGGCAG CAACTCTCCC AAGTCCAGGT 
24051 CCCCAAATGC AGGATTGCCC TCTGCTTAAG AGATCATCCC CGTGTTAGTA 
24101 ATGAAGGACT TCAAGTTGTC AACCTCTTCT CTGACAGCAT CCAGGCCTAG 
24151 CTGCCATGTT ACGGTCGAGA AATGATCTCC CATCCCACCC AACACTCCCC 

242 01 CACTCCTGTC CTTCTTACCC AGGAAAGAGC CAGGGAGGCA AATGAGGAGA 
24251 CAAAGAGCCA CAGCTGGAGA AGCCATGGGG GCAGAAAGGG TAGGAGGATG 
24301 ACGCTGAGGG AATGTCCAAG CATGCAGGGA GACCATCCTC CCAGAGAGCA 

243 51 GAAAGAAATA TTGGTTATTT TTTTTTTCTT TCTTTCTTTT XTTTTTTTTT 

244 01 TTTGAGATGG AGTCTCGCTC TGTCACCCAG GCTAGAGTGC AGTGGCGCCA 
244 51 TCTCGGCTCA CTGCAACCTC TGCCTCCTGA GTTCAAGCAA TTCTTCTGCC 
24501 TCAGCCTCCC AAGTAGCTGA GATTACAGGT GCATGCCACC ACGCCTGGCT 
24551 AATTTTTTTG TATTTTTAGT AGAGATGGGG TTTTGCCATA TTGGCCAGGC 
24601 CGGTCTCGAA CTCCTAACCT CAGGTGATCC ACCTGCCTCA GTCTCCCAAA 
24651 GTGCTGGGAT TACAGGCGTG AGCCACTGTG CCCAGCCAAG ATTGGTATTT 
24701 CTGAGATAAG TTATCCACTC AGTCCGTGGA CCTCAAGAGT TTTCCTCTCC 
24751 CTTTTCAGTC AATAGCGTTC CATTAGTACT TAAAATGAAA TTGATTGTTT 
24 801 GGTATAAAAT ATAAGACATG GTCATTGACC AATTTGAAAG TAGAGGCAAA 
24 851 GCCTACTAGG ATAGTATTTA TTGAGCACTC TATGTGTGGC ACTGTGCTAA 
24901 GGCAAGCGCT TTTAAGTGCA CGACCCCACT GAATCATCCC ACAACCATGG 
24951 ATGGGAGACA CACTCAGTCT CCTTTAACAG AAGATAAAGC TGGGGCTTAC 
25001 AGAGAATGTA CAACTTGTCC AAGGTCACAC AGCTAGCCAT CAGTGGCAGT 
25051 GCTGCTATTC AGGTCTGGGA CTGTGGGACT CCAGAGCCCA TGTTTTTTAC 
25101 GAGGATGCCA TACTGCCACA ATGGATGGTG TCTTTATCTC CTGATATATG 
25151 ATTGTGTGTT GGGAGGCGTG GGGTGGCAGC TGGAAGAATG GAGAGGCATA 
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zooUl 
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26851 


AL AGCALt 111 


s 


zo y ui 
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£3 
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T. S 


z / lb 1 


1 LrLr L 1 ALALrVj 


r 


z /ZUl 


LtLt AAVj 1 1 1 L 1 
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h SD 
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*) *7 "3 C 1 
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z / fl U 1 


1 LAiLAAL 1 A 




Z f *i b 1 


PPTP A P 1 T"T'P 1 0 
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z / b U 1 


rp rp rp r~\ fi rp si (-* tTtfi 

1 1 ILLlLLlLr 
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z / b b 1 
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z / 0 U 1 


a APTTPTA A A 
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z / 0 b 1 


pa APPTPTPA 
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z / / u 1 


TaTPaapapp 




z / / b 1 


ncHHCvn anp 
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0*7001 
z / ou 1 


TPPPTPPa aT 




z / obi 
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z / y u 1 
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z / j?b 1 
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28001 


ATGGGAGCAG 




28051 


AAACTTTTGG 




28101 


CTCTGCTGGC 




28151 


CTCCATCCAG 




28201 


CTGTGACGGG 




28251 


TGAGCAGGGT 




28301 


TTTCAAACCA 



ATCTTCCCCC ATTCTCTGCT 
AGAAATTATC TACTCTGAGA 
CAGTGGCAGA AGCCTGTGCC 
GACTCTCTTT CAGAAATGTC 
ATGGACTTCT ATGAAACAAG 
GAGAGCCCTG GTGACCAGGC 
GTGGGAGGAG GAGCCATAAG 
TTGCCAGGGG AGAGGAGGAG 
CAGTCATTCA CGCGTCTGAC 
GCTGGAAGGC CTCCGGATGC 
TGGAGGAGGA GGAGGGCAAA 
TGCTGGTGCT GAGTCCTGTG 
TCAGGAGGCC CGTGGGCAGA 
TCCTCTCGTG TGATGACCCT 
CCTGGCCCTG AGAGCGGCTC 
GCCCGTCCAC CCTGGAGTGG 
CCTGCTACGA CCCCCACGAT 
CCCACCAGAC AACAGCCAGG 
TTTCTCCGTG CATCGTGTCT 
CTCTTTCTCT GACCCCTCCC 
AACCCCGTCC CCCGTGTGTC 
TCCTAAGCTA TCCAAGGGGA 
TTGGATCCAG GAAGAACCCT 
ACAGGGTGCT TCTTCCTCTT 
GGGCCTGCCT CCCCAGTGAC 
AAACGGCAAC CAGGGAGTCC 
TTCCCCGACC AGAGCTTTGC 
GGAACTGAGC TCAGAGGCAG 
TGGCAGGGAT AAAAAGTCAC 
CACAATTTCT CTCCACCTGG 
TGTGGGTACT CCAGGGGCAA 
GCCAACCATG CACTCCACAA 
CCTATGTATC AGGTTAAGAG 
GCACAGCAGA GGGGAATGTA 
CCATTTCTTA GGTAAGAAGA 
GAGCCTAGCC TTCAGCAACC 
AGAAGGGCAA AGACTTCATG 
CGCTTGGTGC TGTTCTCTTT 
TCTACCTCCT GATAGGCTTG 
TGGAGGGAAG AGGACTCCTT 
TTAACCCCAT ATGGCCCAAG 
AACAAGTAAT TTACCAGTTC 
GGGCGAGGCC TAGGAAGAGG 
GCGTGGCTGC AGCCAAAACA 
AGAACCAAAA TCCAGGGCAC 
TTACTCCTCC AAAAAGAAGT 
ACTTACCAAA CCAGGAAAAC 
CAGGGTATAG CTTAGATGGT 
TCACTGGCTA ATGAGAAAAG 
CTAGGCTGGA GTTGCTTCCT 
TATGCCACAT CACACTGGGG 
TCTGTCTTCA GCCAGGGACT 
CTGTGGACCA GTATTTCAGC 
ATTGTCTTTC TCTCCTAGCC 
GTACCCTAGG GGTTCTGGAA 
TGATAGGCCC TATCCACAAG 
AAGGAGAATT GGGATTTTAG 
TGGCCTGGGG CATGTCTTGA 
CTGTTTACTC GTCACCACCT 
CCCCAGGCAC GCCACCTGCT 
TGAACTTCGT GTACTGTGTC 
TCATCTATTT TAAACACAGA 
CCGGTGTGGC TGCCTGGATG 



ACCCTCTCTT GGAGCTCCCA 
AATCGTCACA ACACAGCATG 
TGGTTGTATG GGCCCCTCCC 
CTTCTCTTGC AGCTGGCGAA 
TGCCTGCACC AACCTCAACA 
GCCCGCTCTC TCGGGCTGAG 
ATGGACCTTA TCCCTCAGGC 
ACACTGGACT AACCTGTGCC 
AGAGCTGGTG CTGCAGGCCC 
GTGCCAGCAA TGAGTTGGCA 
CCCGAGGGCC CAGCGAACTC 
TGGGGCACCC CACACGACAC 
CAGGGGAGCC GGGGCTTTGC 
ATTGAGTATC AGTAGCCACT 
TGCTGTCATC TCAAGCAGCC 
TCTTCTTCAG CCTGTTTCCC 
GTGCCGCAAG CACTGTCTCA 
GCTGGAGTCC AGGCCACTTT 
CTTCTCTGCT TTTTCTCTCT 
CTCCGGTGCG TTTCGTATCA 
CTGCTGTGTG CAGCTCGCTC 
TGGACCCAGG CTCGTGGGGA 
CCACCCTGCC TCGTGGGTGG 
CCCCCACCCC CACTGTCCCT 
CTGCGAAAGT GGAGCATCGA 
TCGAGCCTGG GGCTGCCCTA 
CCTTGCTTGG CTGCCCGCCT 
GTGCTTCAGA GAAGGAAACA 
CTCCATTCTC TACCTCCCAT 
CTCCCAAATT TAAAGATGTG 
GGAGAGCCCT GGGGTCAGTG 
AGGGGAGCAT TTGGAAATGA 
CAAGGGAGAG CTGGCCAGGG 
GCAACAGCAG GGCCTCCTAG 
GCATTTCCTC AGACTCCCAG 
AAGGTTCTCC TGGGACCCAA 
GGAAGAGAGA AGGAAGGCCC 
GGCCTTTAAG ACAAAGCGCT 
AGGGTTTGCC AACCACACTG 
CCTCCAGAGT GCTATGTTCA 
AGTAGCTCGT AGGAGGCCCT 
TACTGGGGT.T CCTGCCCACC 
GTCATTCTTA AGCCACACAT 
AAGAACTGGG TGTTGAGTAT 
TCATATGTGA AGGATAAGAA 
GGGGAAAGAA CCATCAAACC 
AGCAGGAGAG GGTGGCTCAG 
GGAAAGCAAA GGAGAGCAGG 
GAGACAGCTA ACTCTAGGAT 
TGAAGATGGG ACTCCTTGGG 
CTAGGGAAGT AGGTGATGCC 
TGAGAAGTTA TATTGGGCAG 
TTTCCCTGAA GATCAGGCAG 
CCCTCAGGAA AGAAGGACTA 
GGGAAAACAT GGAATCAGGA 
GGCCATGACT GGGAAAAGGT 
GGTGCAGCTA CGCTCACCCT 
GGCCCAGACT GTTAACCAGG 
CTGCACCTGC TGTCTTGAGA 
CCTGAGCCTC CACTATCTCC 
TCGGGTCCAT ATATGAATTG 
TGTTTACAAA ATAAAGATTA 
AGTCCTTGGG GGTAGGTCTC 
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28351 ACTCAGACCC TGGCAGTGAT GTGGGAGGGA GAGAGGCAGT GCTGGTAGAA 

28401 GCAGCTCCAG AAGCAAAGGC AACAGCAGTA GAGTGACCAC GGAAGCGGCA 

28451 AACATTGTCT TCCCTTCTCT ACCTTCCCTA GTGCCACCTG CAGGGAGGCC 

28501 CAAAGCAAAG CCCCGTTGCC CTGCATTGGG CTGGCACTGC AGAAATAAGA 

28551 TGAAACACAG TTATCGAGAG GATGCTGAAC ATCTATGAGC AGGTTTTAAA 

28601 GCCAAGATGA GTCTCATCTG TTTGTGTGGG TCAGGAACGG GTCTTCCTGA 

28651 AGGCATGAGG TGGGACTGGA TAATCTTTCA GATTTGTGAT TGGATACCTC 

28701 GGGGGAGCAG AGGCAGACTG GGATCTCAGG ACTGCAGGTA TTTCATACTT 

28751 TGGGATATGG AATTGATGGA (SEQ ID NO: 3) 
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Context : 
DNA 

Position 
206 



4963 



C3 

a 

CO 8175 

% J 

L U 
3 

w 

^ 10515 

* 

f =| 



13034 



GCTCAAGATTGCACAGCTGGTGAGTGGTGACACTGGGACTGGAACCCAAGTGTGCCTTAC 
TCCAGAGCCCTTGGCATGCACCTGAAACCCCATGTAAGCCCACTGTGGAGACGCGCACCT 
CGAAATAATGGAATCCACTACATCAGTTCCTTTAGCTTTCTGTGTAATCAGAGTAGCTAG 
CAGGCTCGGGATTTCGCCCCCCGGC 



TTTTTTTTTTTTTTTTTTTGAGACAGAGTTTTGCTCTTGTTGCCCAGGCTGGAGTGCAAT 
GGCGCAATCTCGGCTCACCGCAACCTTCGCCTCTCAGGTTCAAGCAATTCTCCTGCCTCA 
GCCTCCCGAGTAGCTGGGATTACAGGCACCGGCCACCACGCCCAGCTAATTTTTTTATAT 
TTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTTTTCCCCTCTT 
ATTATAATTCAGACACTTAACCTGAAATATACCTTTTCAAATGAAGTAAATGGGCTTACC 

TATTAAGGGACTTGGGATTCTCCCTTATCTTGGGCGTGTTTTTCAGCATTAACTAAAACT 
TAAAGGAAAGAGTTGGATGGTCAAGAAAAGCTTTTTCCTTAAGTGATATGGACAGTTTCT 
CAAGGAGGTAGAAGGGGCAGCCAGGAGACAAATCAAGGAGCCAACGAAATGAGTGCTACC 
AAGTCATAGTCATTCGCTTATTTTTAAAAAATGCGTGTCCTGTATGCCAGGCTCTGCACT 
GAGACCGAGAGATTCCAAGATGAATAATACCTACAGTCACTGTTCTCAAATTGTGCATTA 
[C,T] 

CTAAAACACATTACATGACCATGCTGGCCACTGATCGAGGCACCTTTCCCAGGGGCTTTT 
TTTGTGAATTAAGAAAACAAGGTAATTCACCAGTTATTGCCAAGATAGTTTGGCTTCTTG 
GCTCATGTGGATATCACCTAGGCCAGTACTTTTGTGATTTACTGTGTACTCCACTTTAAC 
GGCCTGCGATCTTCTAGAGAAGAACCCGCCAGGGAGCAGTGAGAGGCCTCCCTGGTAGAC 
TGAGACACTGACTGTCCCTCCCCCTATCCTTTTCGTCTTTCTGGCCAGCAGACCAGCAGG 

ATGCCAGGTGCCATGCTAAGATTTGGGGACACAGTGGTGACCAAAACAGACAGAAACCAA 
GGAGCTGGCTTACATTCCAAGGGAGTGCATAGGAAGCTGTGTTTCATTTCAGTTTCTGCT 
CTAGTACCCCCCTTTCCCTGGCAGTGCCAGGGTCTGAGAAGGAAGAGTGAGGTGGTGAGG 
AGGTGTGAAGCAGTGGGGTGACCTGAGAGGAGAGGATGGGGTGGCTTTGCCTCAAGGCTT 
GGGCCCCTGCTAGGTGTCGCTCTGCCTCAGGCCTCTGTTTCTCCTCCTGACACAGGCACA 
[G, A] 

ACTCGGCCTCCCACCCCTTCCCCAAGGACATGACCTTGGGAAGGAACATATCTGAAGCCC 
GCGGAGGGTTTCCGCTGCTGTGCATCTGTGCCACAGATCCGCAGATGCACCCACAGCTGG 
GAGCACCGGTTCCTCCCGCCTACCTGCACTCCCTGGTTTCTGTTCCTTCCTCCTCCTCCT 
TCCTTCTCCCCGCTCCCCAGACAGGCTGGTGATGAGCTTTATAACATGAAAGCTGATATT 
TGGCCATTATCCTTCTACCCTGATTGCCAGCTCTTCTCAGAGTGCCTTCTTCTGTAATCC 

CTGGTGAAGGCTTTGAAGAGGAAGTGACATTTGAGTGGAGTCTTGAAGACTAGGCAGGAT 
TCTCCAGGGGCCCTGGGTGTGGGGGAAGCACACATCCTCTTCCCTGTAGGAGGTGCTGTG 
GAGAACACCTCCAGTGGGGCTGCTACTCTTCAGCCTTGCTGGGGCCAGCTGGAGTGGCCA 
CACCATGGTCACACCAGCTGAAGTTCAAGAAGCCCCTTGCCAGGAGATTGCTTTGCTGGC 
TCTGGGTGAGGGCAGGTGCATCTGGAAGCCCCCTTCTTTCTAAGATGTTTGCTCCTGAGT 
[T,C] 

TCTATGTCCTAGTCTTTTCTTCCCTGAACCTTTTGCTACCAGTCAGCACAGCCCTGCCTG 
AGAAGGAGGCTGGAGGAGTGAGTGGTCAGTAGCCTGGTGGGTCTTGGCTGCCTCTGTGGT 
GCCCGCTGGCCTAAGTAGCAGGCTTAGGGAGGCGAGACCCAGTTCCAGGGGCTGCCAATG 
GGGAGCGAGATGGGGTGGCTGGAGCACACTGCACATGTCACCAAGGCTCTAGGGAGGTCT 
GTGCACAAGGCAGTGGGAAAAGCAAGGGGAAGACCCAGCCTGGTCAACATGGTGAAACCC 

AGATTTGGGTGAGGACACAGCCAAACCATATCAGCTCCCGGGATCCCTGTGTGAATGGGG 
TCTTTTTTGGTGTTTGAGGGCTGCACAGGGTGACCTCTTTAGAGGTGACCTCCTGCCACA 
ACCCACAGGAGGTGCACATGGCCCACACATGCTGGTTTCCTGCAGTGGGAGGGGCTGGGG 
CACTCCTGGGACCTGTGCTTGGTAACTGGAGCTGGCCTGGCCCTGGGGATTGGGTGTCTG 
CCTTGGGTTTCAGGTGTATTAGGTTGTTCCTCGTTGTGGAGTCTCATTACTAATGAAAAG 
[T,C] 

TCAGGTCGCACTGCTGGTCCTTTGGGCTGTGGTTGATCCTGGTGATAACATTTGGCACCC 
AGAGGCAGCCCTGTTTCCACTGAAGCATGCGGAGCTTGGCTGGCAGGCAGGCAAGCTGGC 
AGCTGCCCTTAACCCATGAGGTGCTGGCCCGCTAGTAGGCACACCCTACCTGTGCCAGAA 
TTGAGGTTGTAGCCAGACTCCAGGAGCCATCTGGGCCCCACAGGGGGCGGCATTTCCTCT 
TTTTGTTGAAACATTCCAGCCAAGTGCTGGCTTGGGCTTCATCTCTCTGTCCCACTCTCC 




CCCTGTGTTATGGGTTTTACACCTTATCTCACAATCTTAAAAAAAAAATTCTCTGAGAAT 
CCTCTGTCACCCCCACTTTACAGGTGAGGAAACTGAGGCAAAGATAGGCTAACTGGCTTC 
CCCAACACCATGCAGGTAATTAGTGATAAAGGCAGGGTTGGAACCAAACTTGACCTCCCA 
ATTGTGCTCTTAATGGCCAGGACACTCTGTGTCTTGAGCCACACTTCCTCCATGTTTTCT 
AGGGCTTTCTAGGGAGGCAGACAGTGATGGGAAGGGGTGTTCTTTAGTGTGGATGTGCCC 
[T,C] 

GCCTGCTCCTTTCTGTAAGCGTCACAGCACCTCCACTGCTGTACTGGGGAGGCACCAAGT 
TTTTCCCTGTTTGCCCACCCAAGGCGAGCTAGCTTAGGAGTCACGTGAGTGCTGGGTGTC 
TCGCCTGCTGCATCCCTCTATCCTGCCCCTGCCCCCGGTGCCCAGAGGAGGGCCCTGCCT 
GTCTTCCCAGTTCTCCAACAGCAGCGCTGTCCCAGCACCCTCGGGCTCCAGTTGTGGCCT 
GGCAGCTGCTGGGGCAGACACCATACAGACAGAGTCACAGCAGGAAGAGGATGGGGCCCA 

GGAAGGGGTGTTCTTTAGTGTGGATGTGCCCTGCCTGCTCCTTTCTGTAAGCGTCACAGC 
ACCTCCACTGCTGTACTGGGGAGGCACCAAGTTTTTCCCTGTTTGCCCACCCAAGGCGAG 
CTAGCTTAGGAGTCACGTGAGTGCTGGGTGTCTCGCCTGCTGCATCCCTCTATCCTGCCC 
CTGCCCCCGGTGCCCAGAGGAGGGCCCTGCCTGTCTTCCCAGTTCTCCAACAGCAGCGCT 
GTCCCAGCACCCTCGGGCTCCAGTTGTGGCCTGGCAGCTGCTGGGGCAGACACCATACAG 
[A,C] 

CAGAGTCACAGCAGGAAGAGGATGGGGCCCAGGGCTGCTGCCTCAGGCCATGGCTGCATG 
GCACCATCAGTTGATTGAGGAGCTTTTCTTGCCAATGTCTGAGGCATCAGGTGGCAGGAC 
ACGTCTCCCTGCTCTTAAGCCTCAGGCATGCAGCCCTTCTTATGCTCTCTGGGGTGAGGG 
GGAGATCCCCCTCATGGAATTGCTTTTTTTTTTTTTTTTTTTTTTTTGAGACAGGGTCCT 
GCTCTGTCACTCAGGCTGGAGTGCAGCCTCAACCTCCCAGACTCAAGTGATCCTCCTGCC 

TCTCCAACAGCAGCGCTGTCCCAGCACCCTCGGGCTCCAGTTGTGGCCTGGCAGCTGCTG 
GGGCAGACACCATACAGACAGAGTCACAGCAGGAAGAGGATGGGGCCCAGGGCTGCTGCC 
TCAGGCCATGGCTGCATGGCACCATCAGTTGATTGAGGAGCTTTTCTTGCCAATGTCTGA 
GGCATCAGGTGGCAGGACACGTCTCCCTGCTCTTAAGCCTCAGGCATGCAGCCCTTCTTA 
TGCTCTCTGGGGTGAGGGGGAGATCCCCCTCATGGAATTGCTTTTTTTTTTTTTTTTTTT 
I-iT] 

TTTTGAGACAGGGTCCTGCTCTGTCACTCAGGCTGGAGTGCAGCCTCAACCTCCCAGACT 
CAAGTGATCCTCCTGCCTCAGGCTCCCGAGTAGCTGGGACCACAGGTGGACACCATCACA 
CCTGGGTTTTTTTGTTTTTTGTTTTTTGTTTTCTAGAGATGGGGTCTCACTTTCTTGCTC 
AGTCTGGTCTCGAACTCCTGGGCGCAAGCAGTCCTCCCACCTCGTCTTCCCAAAGTGTTT 
GGATTACAGGTGTGAGCCACTGTGCTTGGCCTTTTTATTTATTTAGAATTTGTTTTGGAA 

GGATGTTTCTTCCATGACATATATAGCTCTTGAAACTACTTCTATCTAATATCACCCACA 
GTGCTGTTAAAAATACAGATTTCTGGGCCTCACCCTCAAATTATGATTCAGTAGGTCTAG 
GCACGTCAAGGTCATTGTTTTTGTCTTTGTTTTAAGTCACCCCAGGTGATTCTAAAGCCG 
AAGCTCTGCAAAGCACACCTTGAGAAACAGAGAACTCTTGTGCTCTCGCTCTCTTGACAC 
TTCAGGTGCAAAACTTTTGTCCTAATGTCGTTCTCAAACTTACGCATGTGTGAGAATCAC 
[T,C] 

GTGAGAGCTTATTGAAACTGATTGCGGGACCCCATACCTAGAGGGCCTGATTCTATAGGT 
CTGAGGTAAGGCCCAAGAATTTGCATATTTGCATTTCGTTTTCTTTTCCTTTCTTTTCTT 
TTTTTTTTTTTTTGAGATGAAGTCTCACCCTGTCGCCCAGACTGGAGTGCAGTGGCATGA 
TCTCAGCTCACTGCAGCCTCTGCCTCCTGGGTTAAAGCGATTCTCCCCACACCCCAGACC 
CGCTCCTGAGTAGCTGGGATTACAGGTGCCCGCCACCATGACTAGCTAACGTTTGTATTT 

AGGCACGTCAAGGTCATTGTTTTTGTCTTTGTTTTAAGTCACCCCAGGTGATTCTAAAGC 
CGAAGCTCTGCAAAGCACACCTTGAGAAACAGAGAACTCTTGTGCTCTCGCTCTCTTGAC 
ACTTCAGGTGCAAAACTTTTGTCCTAATGTCGTTCTCAAACTTACGCATGTGTGAGAATC 
ACTGTGAGAGCTTATTGAAACTGATTGCGGGACCCCATACCTAGAGGGCCTGATTCTATA 
GGTCTGAGGTAAGGCCCAAGAATTTGCATATTTGCATTTCGTTTTCTTTTCCTTTCTTTT 
[C,T] 

GATCTCAGCTCACTGCAGCCTCTGCCTCCTGGGTTAAAGCGATTCTCCCCACACCCCAGA 
CCCGCTCCTGAGTAGCTGGGATTACAGGTGCCCGCCACCATGACTAGCTAACGTTTGTAT 
TTTTAGTAGAGACGGGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCA 
GGTGATCCACTCACCTCAGCCTCCCAAGGTCTTGGGATTACTGGTGTGAGCCACCGCGTG 

TGCAGCCTCTGCCTCCTGGGTTAAAGCGATTCTCCCCACACCCCAGACCCGCTCCTGAGT 
AGCTGGGATTACAGGTGCCCGCCACCATGACTAGCTAACGTTTGTATTTTTAGTAGAGAC 
GGGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACTCA 
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CCTCAGCCTCCCAAGGTCTTGGGATTACTGGTGTGAGCCACCGCGTGCGGCCAGAATTTG 
CATTTCTAACAAGTCCCAGGTGATGCTGATGCTGTGGGTCCAGGGACACACTTTGAGAAC 
[T,C,A] 

GCTTGTTACTCAGGCGATATGTGGACAGTAGCGTCATCTTCACCTGGGAGCTTCCTGCAG 
CATCTCAGGCCTTGCCCTACACCTACCAGATCAGAATCTGCATTTTAACTCAATCCCCGC 
GTGATTCTCATGCACCTGGAAGTTTGAGAAATATGACCTTAGAGGAGCCGGAATGTGAAA 
CCACTGGAGGCAGAGATAGATGGAGAATATCTCTTCTTCTCACGGATACTAAAGATGCAA 
CAAAAAGGGCTGACTCTCTGGGTGTGCACCCAGGTGGGGCTGATGACCGAAAAGAGGCCA 

TGTGTGTGAGGCCGGGGAGTGCTGCGAGCCCCGGAATTCCTCAGCCTTAGTCCCCCGCCA 
CATAGCTAAGAAGTGAGGGAGGAGGTGAGAAGGAGTCACTGCCCAGCCTCACTTCCGGTG 
GAGTACCCTGTCTCCTTGTCAGTTCTGTCTCTGGGGACAGTTGCCTGCTTTCACCTCTCC 
CTCCATCCCCTCTTCTCTCACAGGGAAAAATTCACCTTAATATTGGAAGTTCCTCTCCTA 
GCAAAGTCCTTCTCAGGCACCCACAGGCAAAAAGGAAACTAAGCAGAGTTAGGGCTTCCA 
[G, T] 

GCCTAGCCAACTACACGACTCTCCTCTTGCTTCCCTAAGAACCAGCGCAAGGGGCAGCGT 
GGGTTCCAGCATAGATGGACCTGTGTTGGAATCTCTGCACGTGCTGTGCTGACCCTGGCT 
AGCCATTGACCTCTCTGAGCCCTTGTTTCCTTTCCACTAGGCTCTCTGAGGGCAGGGGCC 
ATGTCTTTTTCACTGCTCTGTCTGCACTGAGCACTGTGCAGGGCACATAGGAAGTTCCCA 
TAAATGTTTGTGGGATAAAGGAAATAAAACCTTCTCTCTTCCTGTCCCCCTTGTGATGGC 

AAAGTCCTTCTCAGGCACCCACAGGCAAAAAGGAAACTAAGCAGAGTTAGGGCTTCCAGG 
CCTAGCCAACTACACGACTCTCCTCTTGCTTCCCTAAGAACCAGCGCAAGGGGCAGCGTG 
GGTTCCAGCATAGATGGACCTGTGTTGGAATCTCTGCACGTGCTGTGCTGACCCTGGCTA 
GCCATTGACCTCTCTGAGCCCTTGTTTCCTTTCCACTAGGCTCTCTGAGGGCAGGGGCCA 
TGTCTTTTTCACTGCTCTGTCTGCACTGAGCACTGTGCAGGGCACATAGGAAGTTCCCAT 
[A,G] 

AATGTTTGTGGGATAAAGGAAATAAAACCTTCTCTCTTCCTGTCCCCCTTGTGATGGCTT 
TGCACAAGGCACTGTCCTTGGCCAGGTTTGCTAGGCTAGTGTGAGGATAAACCAGGTATA 
TTACAAATTGGAGAAAATTTCTCGTTCTTCTTGGAAGAAGGTGCTGTATCATGAAACAAG 
AATGTCTTGATTCCCTTCTATGCCAGGTACTGGGGAGAAACAGGTGCCTGATAACCGTTG 
ATCCAGGCAGAAATAAGCATACTCCTGCTTCCCAAGGCCTGATGCTTCTCTCCTTCCTCC 

CCTTGGATGAAGAAGCGTGGGAACTCTTTGCTTCCTTTCCCTCCCGCAGTGACATGCCAT 
GCCATGCCACTGCCTCTTCATCTGGTCCTATGACAGTCACTCATAAGCACCCGCATGTAC 
CCGGCCCTGCACTAGCTCATGACAGCTGCAGTCAATTGGGCCAGGTGCTGTATCTCATCC 
GGCCTCCTCAGCAACCCTCTGAGATACTGGTAATGTCCCTGATGAAGATATTTACTGAGG 
CAGAAATGGACGCTCAGTGAAGCAAGGTGCCTGATGTTATAGCAATGAGCTATGAGTGGC 
[C,T] 

AGAGGGAGGAGATAAGCTCAGGCCTGACACCAAAGCCCATGCTCCTTCTAGTCAACCACA 
GTGCCTCCTATGGTGAATGAGTGAGTCAGCAACCAAGACGCATGAGGCCTTCTTTTTGGT 
GAGCCTTGGCTGGGTGCTGAGGCTTCAGGTACAATCATGGGTTGGAAGAGCCCTCCTCTC 
TCTCCACAGTCTGGCACTATGACCCCTTCTGGTTATTAACAAGGCAAAGAGAGAGAGGGA 
AGAAAGCAGGCAAATAATGTGGGTTGCTATTCCTAGAGATTAGAATTTCAGGAAGGATAA 

TTCTCTGACCCCTCCCCTCCGGTGCGTTTCGTATCAAAGCTCCTCAAACCCCGTCCCCCG 
TGTGTCCTGCTGTGTGCAGCTCGCTCTTTCCTTCCTTCCTAAGCTATCCAAGGGGATGGA 
CCCAGGCTCGTGGGGAGGTTCCACCCTTGGATCCAGGAAGAACCCTCCACCCTGCCTCGT 
GGGTGGGCCAAAGGCTACAGGGTGCTTCTTCCTCTTCCCCCACCCCCACTGTCCCTCATG 
TGCCATGGGCCTGCCTCCCCAGTGACCTGCGAAAGTGGAGCATCGAGGTAGGAGGGAAAC 
[G, A] 

GCAACCAGGGAGTCCTCGAGCCTGGGGCTGCCCTACCTCTACCCATTCCCCGACCAGAGC 
TTTGCCCTTGCTTGGCTGCCCGCCTGCCTCTTTGGGGAACTGAGCTCAGAGGCAGGTGCT 
TCAGAGAAGGAAACAAAATGAGGGGTGGCAGGGATAAAAAGTCACCTCCATTCTCTACCT 
CCCATGCAGCATGAACACAATTTCTCTCCACCTGGCTCCCAAATTTAAAGATGTGGACCA 
AGGCCTGTGGGTACTCCAGGGGCAAGGAGAGCCCTGGGGTCAGTGACACTGTCAGGCCAA 

ACCCCTCCCCTCCGGTGCGTTTCGTATCAAAGCTCCTCAAACCCCGTCCCCCGTGTGTCC 
TGCTGTGTGCAGCTCGCTCTTTCCTTCCTTCCTAAGCTATCCAAGGGGATGGACCCAGGC 
TCGTGGGGAGGTTCCACCCTTGGATCCAGGAAGAACCCTCCACCCTGCCTCGTGGGTGGG 
CCAAAGGCTACAGGGTGCTTCTTCCTCTTCCCCCACCCCCACTGTCCCTCATGTGCCATG 
GGCCTGCCTCCCCAGTGACCTGCGAAAGTGGAGCATCGAGGTAGGAGGGAAACGGCAACC 
[A, G] 
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GGGAGTCCTCGAGCCTGGGGCTGCCCTACCTCTACCCATTCCCCGACCAGAGCTTTGCCC 
TTGCTTGGCTGCCCGCCTGCCTCTTTGGGGAACTGAGCTCAGAGGCAGGTGCTTCAGAGA 
AGGAAACAAAATGAGGGGTGGCAGGGATAAAAAGTCACCTCCATTCTCTACCTCCCATGC 
AGCATGAACACAATTTCTCTCCACCTGGCTCCCAAATTTAAAGATGTGGACCAAGGCCTG 
TGGGTACTCCAGGGGCAAGGAGAGCCCTGGGGTCAGTGACACTGTCAGGCCAACCATGCA 

GCCAGGGACTTGAGAAGTTATATTGGGCAGTGGCTCCAATCTGTGGACCAGTATTTCAGC 
TTTCCCTGAAGATCAGGCAGGGTGCCATTCATTGTCTTTCTCTCCTAGCCCCCTCAGGAA 
AGAAGGACTATATTTGTACTGTACCCTAGGGGTTCTGGAAGGGAAAACATGGAATCAGGA 
TTCTATAGACTGATAGGCCCTATCCACAAGGGCCATGACTGGGAAAAGGTATGGGAGCAG 
AAGGAGAATTGGGATTTTAGGGTGCAGCTACGCTCACCCTAAACTTTTGGTGGCCTGGGG 
[C,T] 

ATGTCTTGAGGCCCAGACTGTTAACCAGGCTCTGCTGGCCTGTTTACTCGTCACCACCTC 
TGCACCTGCTGTCTTGAGACTCCATCCAGCCCCAGGCACGCCACCTGCTCCTGAGCCTCC 
ACTATCTCCCTGTGACGGGTGAACTTCGTGTACTGTGTCTCGGGTCCATATATGAATTGT 
GAGCAGGGTTCATCTATTTTAAACACAGATGTTTACAAAATAAAGATTATTTCAAACCAC 
CGGTGTGGCTGCCTGGATGAGTCCTTGGGGGTAGGTCTCACTCAGACCCTGGCAGTGATG 

GGCAGTGGCTCCAATCTGTGGACCAGTATTTCAGCTTTCCCTGAAGATCAGGCAGGGTGC 
CATTCATTGTCTTTCTCTCCTAGCCCCCTCAGGAAAGAAGGACTATATTTGTACTGTACC 
CTAGGGGTTCTGGAAGGGAAAACATGGAATCAGGATTCTATAGACTGATAGGCCCTATCC 
ACAAGGGCCATGACTGGGAAAAGGTATGGGAGCAGAAGGAGAATTGGGATTTTAGGGTGC 
AGCTACGCTCACCCTAAACTTTTGGTGGCCTGGGGCATGTCTTGAGGCCCAGACTGTT/^A 
[C,G] 

CAGGCTCTGCTGGCCTGTTTACTCGTCACCACCTCTGCACCTGCTGTCTTGAGACTCCAT 
CCAGCCCCAGGCACGCCACCTGCTCCTGAGCCTCCACTATCTCCCTGTGACGGGTGAACT 
TCGTGTACTGTGTCTCGGGTCCATATATGAATTGTGAGCAGGGTTCATCTATTTTAAACA 
CAGATGTTTACAAAATAAAGATTATTTCAAACCACCGGTGTGGCTGCCTGGATGAGTCCT 
TGGGGGTAGGTCTCACTCAGACCCTGGCAGTGATGTGGGAGGGAGAGAGGCAGTGCTGGT 

CTGCTGGCCTGTTTACTCGTCACCACCTCTGCACCTGCTGTCTTGAGACTCCATCCAGCC 
CCAGGCACGCCACCTGCTCCTGAGCCTCCACTATCTCCCTGTGACGGGTGAACTTCGTGT 
ACTGTGTCTCGGGTCCATATATGAATTGTGAGCAGGGTTCATCTATTTTAAACACAGATG 
TTTACAAAATAAAGATTATTTCAAACCACCGGTGTGGCTGCCTGGATGAGTCCTTGGGGG 
TAGGTCTCACTCAGACCCTGGCAGTGATGTGGGAGGGAGAGAGGCAGTGCTGGTAGAAGC 
[A, G] 

GCTCCAGAAGCAAAGGCAACAGCAGTAGAGTGACCACGGAAGCGGCAAACATTGTCTTCC 
CTTCTCTACCTTCCCTAGTGCCACCTGCAGGGAGGCCCAAAGCAAAGCCCCGTTGCCCTG 
CATTGGGCTGGCACTGCAGAAATAAGATGAAACACAGTTATCGAGAGGATGCTGAACATC 
TATGAGCAGGTTTTAAAGCCAAGATGAGTCTCATCTGTTTGTGTGGGTCAGGAACGGGTC 
TTCCTGAAGGCATGAGGTGGGACTGGATAATCTTTCAGATTTGTGATTGGATACCTCGGG 

GCACGCCACCTGCTCCTGAGCCTCCACTATCTCCCTGTGACGGGTGAACTTCGTGTACTG 
TGTCTCGGGTCCATATATGAATTGTGAGCAGGGTTCATCTATTTTAAACACAGATGTTTA 
CAAAATAAAGATTATTTCAAACCACCGGTGTGGCTGCCTGGATGAGTCCTTGGGGGTAGG 
TCTCACTCAGACCCTGGCAGTGATGTGGGAGGGAGAGAGGCAGTGCTGGTAGAAGCAGCT 
CCAGAAGCAAAGGCAACAGCAGTAGAGTGACCACGGAAGCGGCAAACATTGTCTTCCCTT 
[C,G] 

TCTACCTTCCCTAGTGCCACCTGCAGGGAGGCCCAAAGCAAAGCCCCGTTGCCCTGCATT 
GGGCTGGCACTGCAGAAATAAGATGAAACACAGTTATCGAGAGGATGCTGAACATCTATG 
AGCAGGTTTTAAAGCCAAGATGAGTCTCATCTGTTTGTGTGGGTCAGGAACGGGTCTTCC 
TGAAGGCATGAGGTGGGACTGGATAATCTTTCAGATTTGTGATTGGATACCTCGGGGGAG 
CAGAGGCAGACTGGGATCTCAGGACTGCAGGTATTTCATACTTTGGGATATGGAATTGAT 
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